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1  Introduction 

Between  the  well-studied  areas  of  discontinuous  control  [1],  [2]  on  the  one  hand 
and  sampled  data  control  [3]  on  the  other  lies  the  largely  unexplored  area  of 
logic-based  switching  control  systems.  By  a  logic-based  switching  controller  is 
meant  a  controller  whose  subsystems  include  not  only  familiar  dynamical  com¬ 
ponents  {integrators,  summers,  gains,  etc.}  but  logic-driven  elements  as  well 
[4]}-  More  often  than  not  the  predominately  logical  component  within 
such  a  system  is  called  a  supervisor  [5],  a  mode  changer  [6],  a  gain  sched¬ 
uler,  or  something  similar.  Within  the  last  decade  a  number  of  analytical  stud¬ 
ies  of  such  systems  have  emerged,  mainly  in  the  area  of  self-adjusting  control 
[7,  8,  9,  10,  11,  12,  13,  14,  15,  16].  These  studies  and  others  have  shown  that 
much  can  be  gained  by  using  logic-based  switching  together  with  more  familiar 
techniques  in  the  synthesis  of  feedback  controls.  The  overall  models  of  systems 
composed  of  such  logics  together  with  the  processes  they  are  intended  to  control 
are  concrete  examples  of  hybrid  dynamical  systems  [17,  18,  19].  The  aim  of  this 
paper  is  to  give  a  brief  tutorial  review  of  four  different  classes  of  hybrid  systems 
of  this  type  -  each  consists  of  a  continuous- time  process  to  be  controlled,  a  pa¬ 
rameterized  family  of  candidate  controllers,  and  an  event  driven  switching  logic. 
Three  of  the  logics,  called  prerouted  switching,  hysteresis  switching  and  dwell- 
time  switching  respectively,  are  simple  strategies  capable  of  determining  in  real 
time  which  candidate  controller  should  be  put  in  feedback  with  a  process  in  order 
to  achieve  desired  closed-loop  performance.  The  fourth,  called  cyclic  switching, 
has  been  devised  to  solve  the  long-standing  stabilizability  problem  which  arises 
in  the  synthesis  of  identifier-based  adaptive  controllers  because  of  the  existence 
of  points  in  parameter  space  where  the  estimated  model  upon  which  certainty 
equivalence  synthesis  is  based,  loses  stabilizability. 

In  section  2,  we  discuss  several  basic  issues  common  to  supervised  control 
systems  of  all  types.  In  most  cases  of  interest,  the  job  of  a  supervisor  is  to 
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orchestrate  the  switching  of  a  sequence  of  candidate  controllers  into  feedback  or 
series  with  a  process,  so  as  to  achieve  some  prescribed  goal.  No  matter  what  the 
goal  might  be,  the  underlying  architecture  of  the  supervised  control  system  is 
pretty  much  the  same  -  at  least  in  concept.  In  section  2  we  make  the  point  that 
such  “multi-controller'’  architectures  can  usually  be  implemented  most  efficiently 
as  “state-shared”  parameter-dependent  controllers. 

In  section  3  we  briefly  discuss  two  examples  of  logic-based  switching  con¬ 
trollers  which  arise  in  nonadaptive  applications.  The  first  is  an  ‘intelligent’  con¬ 
trol  strategy  devised  to  maximize  system  performance  while  at  the  same  guar¬ 
anteeing  that  hard-bound  saturation  constraints  are  satisfied  [20].  The  second 
is  a  simple,  time-invariant,  chatter- free,  switching  logic  with  one  state  variable, 
which  is  capable  of  asymptotically  stabilizing  a  particular  bilinear  system  of 
current  interest  called  the  “nonholonomic  integrator”  [21]. 

The  a:im  of  §4  is  to  explain  the  concepts  of  prerouted,  hysteresis,  dwell-time, 
and  cyclic  switching.  Although  each  of  these  strategies  is  applicable  to  a  variety 
of  systems  [11,  12,  13,  14,  22,  16,  23],  for  the  sake  of  uniformity  all  are  reviewed 
within  the  context  of  a  single  prototype  problem  -  the  set-point  control  of  a 
siso  linear  system  with  large-scale  parametric  uncertainty  [24].  The  problem  is 
formulated  in  §4.1. 

The  concept  of  “prerouted  switching”  is  closely  allied  with  the  idea  of  a 
“nonestimator  based  supervisor”;  both  topics  are  discussed  in  §4.2. 

Section  4.3  focuses  on  the  idea  of  an  estimator-based  supervisor.  It  is  within 
this  context  that  the  concepts  of  hysteresis  switching  and  dwell-time  switching 
are  explained.  The  idea  of  cyclic  switching  is  then  reviewed  in  section  4.4 

The  logics  discussed  in  §4  are  conceptually  straight  forward.  What’s  inter¬ 
esting  about  them  theoretically  is  the  set  of  technical  questions  they  generate. 
Most  of  the  questions  have  to  do  with  dynamical  systems  in  which  switching  is 
non-terminating,  non-chattering  and  asynchronous.  Many  unanswered  questions 
exist.  Some  are  briefly  discussed  in  §5. 

2  Multi-Controllers 

Perhaps  the  simplest  .architecture  one  can  think  of  for  a  feedback  system  em¬ 
ploying  a  family  of  controllers  is  that  depicted  in  Figure  1.  That  is,  the  measured 
output  y  of  a  process  to  be  controlled  drives  a  bank  of  controllers,  each  controller 
generating  a  candidate  {possibly  vector- valued]  feedback  signal  Ui.  The  control 
signal  applied  to  the  process  at  each  instant  of  time  is  then 

U  = 

where  t]  :  [0,  oo)  — >  J  is  a  piecewise-constant  switching  signal  taking  values  in  the 
family’s  index  set  X.  The  generation  of  such  a  switching  signal  is  typically  carried 
out  by  some  type  of  hybrid  dynamical  system  which  depending  on  the  situation 
might  be  called  a  tuner,  a  supervisor,  a  mode-changer,  or  something  similar.  In 
the  sequel  we  shall  refer  to  such  architectures  informally  as  multi-controllers. 
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Multi-Controllers 
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Fig.  1.  Multi-Control 


Many  multi-controller  configurations  can  be  implemented  using  a  much  sim¬ 
pler  architecture  than  Figure  1  would  suggest.  The  key  factor  which  makes  this 
possible  is  simply  that  at  any  instant  of  time  only  one  of  the  constituent  con¬ 
troller  is  to  be  applied  to  the  process.  Because  of  this,  at  each  time  t  it  is  only 
necessary  to  generate  one  candidate  control  signal.  Often  this  means  significant 
simplification  can  be  achieved  if  all  control  signals  are  generated  by  a  single 
system.  In  other  word,  rather  than  implementing  each  of  the  controllers  in  the 
family  as  a  separate  dynamical  system,  one  can  often  achieve  the  same  end  using 
a  single  controller  with  adjustable  parameters.  The  idea  is  quite  straight  forward 
and  is  called  state  sharing. 

For  example  suppose  that  it  is  desired  to  implement  a  finite  {or  even  count¬ 
able}  family  of  siso  linear  controllers  with  reduced  transfer  functions 


Ki[s)  = 


0i{s)  ’ 


i  G  I 


where  each  /?,•  (s)  is  a  monic  polynomial.  Assuming  a  fixed  upper  bound  n  for  the 
McMillan  Degrees  of  the  /Ci(s),  it  is  always  possible  to  “cover”  this  family  with 
a  parameter-dependent  transfer  function  hq{s)  whose  denominator  is  of  degree 
n  and  whose  parameter  vector  q  takes  values  in  a  linear  space  of  dimension  not 
exceeding  2n-l-l.  In  fact,  for  any  positive  integer  n  <  2n-|- 1,  it  is  always  possible 
to  pick  a  subset  Q  C  IR^  with  the  same  cardinality  as  Z,  and  a  parameter- 
dependent  transfer  function  hg{s)  so  that  for  each  i  G  1  there  is  a  E  Q  such 
that  Ki{s)  =  hq^{s)  after  cancellation  of  common  poles  and  zeros.  Moreover  it 
is  always  possible  to  choose  hq{s)  in  such  a  way  that  whenever  such  pole-zero 
cancellations  occur,  they  occur  at  prescribed  stable  locations. 

Having  constructed  such  an  hq{s)^  the  above  multi-controller  can  be  imple¬ 
mented  as  a  parameter  dependent  system  Ec  (cr)  of  the  form 


x'c  “  A^xc  T  bay 


(1) 


Uo  =  faXC  +  gaV 


(2) 


9 


where  {Aq,  hg,  fg,  g^j]  is  a  7?.-dimensional  realization  of  hg{s)  and  a  is  a  piecewise 
constant  switching  signal  taking  values  in  Q.  The  resulting  rnuhTcontrol  system 
would  then  appear  as  in  Figure  2. 


Fig.  2.  State-Shared  Multi- Controller  Implementation 


If  the  supervisor  is  allowed  to  re-initialize  state  at  switching  times,  then 

this  implementation  can  generate  exactly  the  same  feedback  control  signal  as 
would  have  been  generated  had  the  original  architecture  been  employed. 

For  multi-controller  families  consisting  of  more  than  just  a  few  controllers, 
this  state-shared  implementation  is  clearly  a  lot  less  complicated  than  a  direct 
implementation  of  the  original  multi-controller.  Moreover  state-sharing  frees  one 
from  having  to  be  concerned  about  the  boundedness  of  the  out-of-loop  control 
signals  which  would  be  present  in  a  direct  implementation  of  the  original  multi- 
controller  architecture. 

There  are  of  course  a  great  many  different  ways  to  realize  hg{s).  The  only 
essential  requirement  of  any  such  realization  is  that  it  be  a  “globally  detectable, 
globally  stabilizable”  system;  i.e.  for  each  fixed  value  of  g  G  Q,  the  linear  sys¬ 
tem  {Ag,bg,  fq,  gg})  shouM  be  stabilizable  and  detectable^.  For  without  stabi- 
lizability,  closed-loop  boundedness  of  u  and  y  cannot  be  assured  and  without 
detectability  boundedness  of  state  cannot  be  assured  even  if  u  and  y  are. 
One  familiar  structure  which  is  globally  detectable  is  of  the  form 


A  0 
0  A 


T 


fq  5 


fg  5  ^g 


where  (A,  6)  is  a  parameter-independent,  n-dimensional  siso,  controllable  pair 
with  A  stable.  Another  is  {A -f  kgf,bg,  f,dg}  where  (f,A)  is  an  n-dimensional, 
parameter-independent  observable  pair.  This  particular  realization  is  actually 
observable  for  all  g  G  IR^;  moreover  in  the  event  that  dg  is  constant  on  Q,  this 

^  The  reader  should  recognize  that  any  such  parameter-dependent  system  will  ahvays 
have  points  in  Q  at  which  it  is  not  controllable  and  observable  if  the  transfer  functions 
being  realized  are  not  all  of  the  same  McMillan  Degree. 


3  Examples 
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realization  guarantees  that  there  will  be  a  “bumpless”  transfer  between  control 
signals  when  a  switches;  i.e.,  u  ^  is  continuous,  even  at  those  times  at  which 
cr  changes  values.  Of  course  bumpless  transfer  can  also  be  achieved  with  state- 
reinitialization,  whether  dq  is  constant  on  Q  or  not. 

It  is  fairly  clear  that  the  preceding  ideas  apply  to  multi-controller  families  of 
mimo  finite  dimensional  controllers  configured  in  almost  any  way  imaginable.  It 
is  also  clear  that  the  number  of  (fixed-parameter)  controllers  one  might  contem¬ 
plate  implementing  in  a  particular  multi-controller  application  need  not  be  finite 
nor  even  countable.  In  other  words  the  complexity  of  a  multi-controller  is  not  so 
much  a  function  of  a  number  of  controllers  in  a  family  as  it  is  of  the  number  of 
algebraically  independent  gains  needed  to  parameterize  the  family. 


3  Examples 

In  the  sequel  are  several  examples  of  {nonadaptive}  logic-based  switching  con¬ 
trollers. 

3.1  Smart  Governors 

An  important  problem  of  continuing  interest  is  that  of  developing  feedback  con¬ 
trollers  for  linearly  modeled  processes  whose  associated  inputs  and  outputs  are 
required  to  satisfy  hard-bound  magnitiude  constraints.  Remarkable  advances 
have  recently  been  made  in  the  development  of  implementable  algorithms  for 
the  stabilization  of  such  systems  [25,  26].  At  the  same  time  there  has  also  been  a 
growing  interest  in  the  development  of  “smart  controllers”  employing  logic  aimed 
not  only  at  maintaining  loop  stability,  but  at  enhancing  system  performance  as 
well  [27,  28,  29,  20].  One  configuration  characteristic  of  this  line  of  research  is 
as  follows. 


Here  Ep  represents  a  linear  process  with  an  input  saturation  constraint,  is 
an  adjustable  gain,  and  k  is  a  linear  controller.  The  idea  is  to  design  n  to  meet 
performance  specifications  in  the  absence  of  saturation  constraints;  this  is  done 
for  the  case  ga  —  1.  The  error  governor  is  designed  to  adjust  g's  value  to  give 
the  best  performance  possible  subject  to  the  requirement  that  the  saturation 
constraints  are  satisfied.  This  is  accomplished,  roughly  speaking,  by  leaving^  set 
at  1  whenever  r  is  ‘smalE  and  by  reducing  ^’s  value  when  r  is  'large’  by  as  much 
as  is  required  to  insure  that  there  is  no  saturation.  The  error  governor  which 


74 


accomplishes  this  is  a  logical  circuit  which  carries  out  the  required  computations 
in  real  time.  A  generalized  {discrete-time}  version  of  the  preceding  with  greatly 
reduced  computational  requirements  has  been  proposed  in  [27]. 

An  even  more  elaborate  multi-controller  architecture,  aimed  at  a  similar 
problem  has  been  suggested  in  [20].  The  problem  addressed  is  to  bring  to  zero 

from  an  admissible  start,  the  state  x  —  {xp^xc}  of  the  system  T’((t)  depicted 
in  the  following  figure  while  not  violating  a  set  of  prespecified  state  constraints 
along  the  way. 


Associated  with  each  fixed  control  index  ^  G  Q  is  a  maximal  admissible  set  Sg. 
A  state  is  in  Sg  just  in  case  each  point  on  the  closed-loop  trajectory  of  T'(g) 
emanating  from  xq,  satisfies  the  aforementioned  state  constraints.  According  to 
[20],  it  is  possible  to  use  the  theory  of  maximal  output  admissible  sets  [30]  to 
design  controllers  so  that  Sg  C  Sg-\.l  and  in  addition  so  that  controller  q  achieves 
better  performance  than  controller  q  +  I  when  the  system  is  initialized  at  a 
state  in  Sg  OSg^i .  In  [20]  it  is  then  explain  how  to  construct  a  supervisor  which 
successively  switches  a  to  smaller  and  smaller  values  to  as  to  achieve  better  and 
better  performance  while  satisfying  state  constraints. 


3.2  Nonholonomic  Integrators 

For  more  than  a  decade  it  has  been  known  that  there  are  nonlinear  systems  which 
are  locally  null  controllable  but  which  nevertheless  cannot  be  locally  asymptot¬ 
ically  stabilized  with  any  smooth,  time-invariant  controller  [31].  A  prototypical 
example  of  this  is  the  bilinear  system 


X  =  u 
y  =  V 

z  —  XV  —  yu 

which  is  sometimes  called  the  “nonholonomic  integrator”  [32].  Nonholonomic 
systems  such  as  this  have  evoked  considerable  interest  in  recent  years  [33].  This 
has  been  especially  true  of  the  nonholonomic  integrator  itself.  For  example,  a 
number  of  time-varying,  periodic  controllers  have  been  devised  which  asymp¬ 
totically  stabilize  the  above  system  {cf.  [32]}.  In  addition,  by  appealing  to  the 
theory  of  sliding  modes  [1],  it  has  been  recently  shown  that  the  simple  discon¬ 
tinuous  control  u  =  -X  +  ^(sign(z))  v  =  -y  -  a:(sign(z))  will  drive  x,  y,  and  z 
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to  zero  provided  one  admits  generalized  solution  in  the  sense  of  [34].  It  turns  out 
to  be  possible  to  achieve  asymptotic  stability  without  chattering  using  a  time- 
invariant  logic-based  switching  controller.  One  strategy  which  accomplishes  this 
uses  a  multi-controller  of  the  form 

[!l  ^9o{x,y,z) 


where 


9i  = 


1 

1 


x~{-yz 

y-xz 


93  — 


-X  +  yz 
-y-  xz 


and  (j  is  a  piece- wise  constant  switching  signal  taking  values  in  Z  =  {1, 2,  3,  4}. 
a  is  generated  by  a  supervisor  of  the  form 


(x,  y,z} 


w  = 

2  2 
X  +  y 

w 

where  Us  is  a  switching  logic  whose  input  is  w  and  whose  state  and  output 
are  both  a.  Us^s  definition  requires  one  to  pick  four  appropriately  structured 
overlapping  regions  q  El  which  together  cover  the  closed  positive  quadrant 

Q  =  {(ri,  r2)  :  7"!  >  0,  r2  >  0}  in  One  possible  set  of  regions  is 

TZi  =  {(ri,r2)  :  r2  <  27r(r2),  (ri,r2)  G  ^2} 

712  =  {(ri,r2)  ;  7r(ri)  <  r2  <  47r(ri),  (ri,r2)  G  O} 

:  »*2  >  37r(ri),  (ri,r2)  G  O} 

714  =  {{0,0)} 

where  7r(ri)  4  (1  —  Z's’s  internal  logic  is  then  defined  by  the  computer 

diagram 


Qw 


=  min{^  :  w  Ellqi 


q  El} 


where 
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In  interpreting  this  diagram  it  is  to  be  understood  that  a  s  value  at  each  of  its 
switching  times  i  is  its  limit  from  above  as  t  it.  Thus  if  ti  and  ti^i  are  any  two 
successive  switching  times,  then  a  is  constant  on 

It  can  be  shown  that  with  this  switching  logic,  chattering  cannot  occur  and 
that  x.y  and  ^  must  tend  to  zero  no  matter  how  they  and  a  are  initialized  [21]. 
It  can  also  be  shown  that  the  origin  x  =  y  =  z  —  0  is  ‘Lyapunov  stable’  in  an 
appropriately  defined  sense.  We  refer  the  reader  to  [35]  for  a  different  application 
of  a  switching  logic  similar  to  the  one  we’ve  been  discussing. 


4  Self-Adjusting  Control 

The  aim  of  this  section  is  to  give  a  brief  tutorial  overview  of  four  different 
classes  of  logic-based  switching  control  systems  -  each  consists  of  a  continuous¬ 
time  process  to  be  controlled,  a  parameterized  family  of  linear  controllers,  and 
an  event  driven  switching  logic.  Three  of  the  logics,  called  prerouted  switching, 
hysteresis  switching  and  dwelhtime  sivitching  respectively,  are  simple  strategies 
capable  of  determining  in  real  time  which  controller  from  a  family  of  candidates 
should  be  put  in  feedback  with  a  process  in  order  to  achieve  desired  closed-loop 
performance.  The  fourth,  called  cyclic  switching,  has  been  devised  to  solve  the 
long-standing  stabilizability  problem  which  arises  in  the  synthesis  of  identifier- 
based  adaptive  controllers  because  of  the  existence  of  points  in  parameter  space 
where  the  estimated  model  upon  which  certainty  equivalence  synthesis  is  based, 
loses  stabilizability.  Although  each  of  these  strategies  is  applicable  to  a  variety  of 
problems,  the  sake  of  uniformity  all  are  explained  within  the  context  of  a  single 
prototype  problem  -  the  set-point  control  of  a  siso  linear  system  with  large-scale 
parametric  uncertainty  §4.1.  The  concept  of  prerouted  switching  is  closely  allied 
with  the  idea  of  a  “nonestimator  based  supervisor”;  both  topics  are  discussed 
in  §4.2.  Hysteresis  switching  and  dwell-time  switching  are  explained  in  §4.3  in 
connection  with  the  concept  of  an  estimator-based  supervisor.  Cyclic  switching 
is  discussed  in  §4.4 

4.1  The  Problem 

The  prototype  problem  we  want  to  consider  is  basic;  to  construct  a  control 
system  capable  of  driving  to  and  holding  at  a  prescribed  set-point,  the  output  of  a 
process  modeled  by  a  dynamical  system  with  large  scale  parametric  uncertainty. 
Assume  the  process  admits  the  model  of  a  siso  controllable,  observable  linear 
system  Up  with  control  input  u  and  measured  output  y.  Further  assume  that 
Ep's  transfer  function  from  w  to  y  is  a  member  of  a  known  class  of  admissible 
strictly  proper  transfer  functions  Cp,  In  view  of  the  requirements  of  set-point 
control,  assume  that  the  numerator  of  each  transfer  in  Cp  is  nonzero  at  s  =  0. 

The  specific  design  goal  is  to  construct  a  positioning  or  set-point  control 
system  capable  of  causing  y  to  approach  and  closely  track  any  constant  reference 
input  r.  Towards  this  end  we  introduce  a  tracking  error 


/i 

eT  =  r  ~  y 


(3) 
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and  an  integrating  subsystem  to  generate  u;  i.e., 


ii  =  V  (4) 

Here  v  is  a  control  signal  which  will  be  defined  in  the  sequel. 

As  our  concern  is  mainly  with  supervisory  control,  we  are  going  to  take  as 
given,  a  parameterized  family  of  proper,  reduced  controller  transfer  functions 

JC  =  {Kg  :  q  £  Q}  which  has  the  property  that  for  each  transfer  function  r  G  Cp, 
there  is  at  least  one  controller  transfer  function  k  E  JC  which  internally  stabilize 
feedback  interconnection  shown  in  Figure  3. 


Fig.  3.  Feedback  Interconnection 


The  sub-system  to  be  supervised  is  thus  of  the  form 


Fig.  4.  Supervised  Sub-System  T'(<t) 


where  Uc{q)  is  a  parameter-dependent,  globally  detectable/stabilizable  realiza¬ 
tion  of  Kg  with  state  xc-  In  the  sequel  we  shall  describe  various  types  of  super¬ 
visors  capable  of  generating  cr  so  as  to  at  least  achieve  set-point  regulation  {i.e., 
eT  0}  and  global  boundedness. 


4.2  Nonestimator- Based  Supervisor 

A  ‘nonestimator-based’  supervisor  is  a  hybrid  dynamical  system  whose  input  is 
a  suitably  defined  “tuning  error”  e  and  whose  output  is  cr. 


e 


supervisor 


a 


A  tuning  error  is  a  linear  {possibly  parameter-dependent}  function  of  the  mea¬ 
surable  signals  in  the  sub-system  Z’(cr)  shown  in  Figure  4.  The  key  requirements 
governing  the  selection  of  e  are  as  follows. 
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Timing  Error  Requirements: 

1.  For  each  fixed  q  e  Q,  S{q)  must  be  detectable  through  e. 

2.  For  each  constant  r  and  each  q  G  Q.  f  must  vanish  on  Ulq)  s  pfiuilibi  iuni 

state. 

The  global  detectability  requirement  is  fundamental.  Its  significance  has  been 
discussed  in  a  broader  context  in  [36]. 

One  definition  of  e  which  satisfies  both  requirements  for  the  problem  under 

consideration  is  r  i 

ex 


There  are  many  other  acceptable  choices  as  well. 

Assume  that  e  has  been  defined  so  that  the  preceding  requirements  are  sat¬ 
isfied.  The  sub-system  depicted  in  Figure  4  then  admits  a  state  space  model  of 
the  form 

e  —  Co  ^ 

^  —  AoX-\-bor  ('^) 

where  x  is  the  composite  state 


and  Ag,bq  and  C,  are  parameter-dependent  matrices  determined  by  the  def¬ 
inition  of  e  and  the  coefficient  matrices  of  Up  and  Ec-  The  position  of  the 
integrator  in  Figure  4  is  important  [24]:  its  location  guarantees  that  for  each 
fixed  r,  the  equilibrium  state  of  (5),  namely 

io  =  -AqHqr, 

is  independent  of  q  E  Q-  Because  of  this  and  the  assumption  that  e  satisfies 
Tuning  Error  Requirement  2,  it  is  possible  to  write 

e  =  CcrX 

X  =  AgX  (6) 

where  x  =  x  —  xq- 

What  we  want  to  do  is  to  explain  how  to  construct  a  supervisor  whose  output 
causes  a;  0  as  <  oo.  The  only  properties  of  (6)  which  we  will  exploit  are 

the  following: 

Properties  of  {Cq,  Aq)  : 

1.  There  exists  a  parameter  value  q*  E  Q  for  which  Aq-  is  a  stability  matrix. 

2.  (Cq,Aq)  is  detectable  for  each  q  €  Q- 
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The  first  property  is  a  consequence  of  the  assumption  that  for  each  transfer 
function  r  £  Cp  there  is  a  transfer  function  /c  E  /C  which  stabilizes  the  system 
shown  in  Figure  3.  The  second  property  follows  from  Tuning  Error  Requirement 
1. 

Apart  from  the  preceding,  nothing  is  assumed  about  (6)  other  than  that  e 
can  be  measured.  In  particular,  neither  the  parameter-dependent  pair  [Cq,Aq) 
nor  q*  are  presumed  to  be  known.  Of  course  with  so  little  known,  one  should 
not  expect  to  come  up  with  a  supervisor  worthy  of  actual  implementation  unless 
perhaps  Q  is  a  finite  set  with  a  small  number  of  elements. 

Within  the  parameter-adaptive  framework  proposed  in  [36],  a  nonestimator- 
based  supervisor  would  be  called  a  “prerouted”  parameter  tuner.  All  parameter 
tuners,  be  they  prerouted  or  not,  are  based  on  the  same  underlying  strategy 
which  roughly  speaking  is  to  keep  adjusting  a  until  e  is  “small”  in  some  suitably 
defined  sense.  Although  there  are  a  great  many  different  methods  for  accom¬ 
plishing  this,  in  most  instances  tuning  is  carried  out  in  one  of  two  fundamentally 
different  ways  depending  on  whether  the  'path’  a  takes  in  Q  is  ‘prerouted’  or 
not.  For  the  prerouted  case,  tuning  is  achieved  by  moving  a  through  Q  along  a 
prespecified  path  or  route,  using  on-line  {i.e.,  real-time}  data  to  decide  only  if 
and  when  or  how  fast  to  change  cr  from  one  value  along  the  path  to  the  next.  In 
contrast,  for  the  non-prerouted  case,  the  path  in  Q  along  which  a  is  adjusted  is 
not  prespecified  off-line  but  instead  is  determined  in  real  time  from  the  values 
of  various  measured  signals. 

The  basic  idea  of  prerouted  tuning  was  devised  by  Martensson  with  the  ex¬ 
pressed  purpose  of  delineating  the  theoretical  limits  of  what  might  be  achieved 
with  any  adaptive  algorithm  [7].  Over  the  past  decade  many  refinements  and 
modifications  of  the  concept  have  appeared  [8,  9,  10,  37,  38].  Although  these 
modified  algorithms  differ  from  each  other  in  many  ways,  all  share  certain  un¬ 
derlying  features  in  common.  In  most  cases  prerouted  tuners  consist  of  the  Ccis- 
cade  connection  of  two  subsystems,  one  a  scheduling  logic  Us  and  the  other  a 
memory  less  map  h  :  {1,2,.. oo}  Q  called  a  routing  function. 


e 


7| 


a 


h  is  invariably  required  to  have  the  revisitation  property:  That  is,  for  any  g  E  Q 
and  any  positive  integer  i  there  must  exist  an  integer  j  >  i  at  which  h{j)  =  q. 
In  other  words,  h  must  have  the  property  that  the  prerouted  path  h{l)/h{2), . . . 
revisits  {i.e.,  passes  through}  each  point  in  Q  infinitely  often.  For  this  to  be 
possible,  Q  must  clearly  be  at  least  a  countable  set^.  Assuming  this  to  be  the  case, 
it  is  always  possible  to  define  a  routing  function  with  the  revisitation  property. 
One  way  to  do  this  is  as  follows. 

^  Actually  in  Martenson’s  original  work  Q  is  a  continuum  and  the  elements  of  the  se¬ 
quence  h(l),/i(2), ...  are  only  required  to  get  close  to  {rather  than  equal}  previously 
visited  ones  [39]. 
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1.  If  Q  =  {^1, 72,  •  •  • ,  ^/rn}  is  a  finite  set,  define  h  to  be  the  in  periodic  function 
whose  first  m  values  are  71, 72,  •  •  • ,  and  7^  respectively. 

2.  If  Q  ~  {71, 72,  •  •  ■ ,  }  is  not  a  finite  set,  define  h  be  the  function  whose  se¬ 
quence  of  values  /^(  1).  fi(2), .  .  .  are  the  elements  of  tlie  seciuence  71 . 71,  72,  71, 72, 

There  are  many  possible  ways  to  define  Es,  depending  what  one  is  trying  to 
accomplish.  For  illustrative  purposes,  we  shall  take  Es  to  be  a  hybrid  dynamical 
system  whose  input  is  e  and  whose  output  is  a  piecewise-constant  switching 
signal  T]  taking  values  in  the  set  of  positive  integers.  Ts’s  state  consists  of  four 
variables  -  a  timing  signal  r,  a  piecewise-continuous  'performance  signal’  tt  and 
a  piecewise  constant  'sampled  performance  signal’  tt.  Both  tt  and  ^  take  values 
in  [0,  00).  Timing  signal  r  takes  values  in  the  closed  interval  [0,  where  tjj  is 
a  preselected  positive  number  called  a  dwell  time.  Ts’s  dynamics  are  defined  by 
the  following  computer  diagram. 


start 


The  functioning  of  Es  is  as  follows.  During  the  first  td  time  units  after  the 
algorithm  is  initiated,  r  is  increased  linearly  from  0  to  To  using  a  reset  integrator 
and  TT  is  increased  from  0  according  to  the  rule 

(7) 

Just  at  the  end  of  this  period,  r  is  reset  to  zero,  the  reset  integrator  is  turned 
off,  and  TT  is  set  equal  to  the  present  value  of  tt.  So  long  as  tt  remains  less  than  or 
equal  to  cttf,  the  updating  of  tt  continues  according  to  (7).  If  and  when  tt  becomes 
greater  than  t^tt,  t]  is  incremented  by  1,  tt  and  ft  are  reset  to  zero,  and  the  entire 
process  is  repeated.  Note  that  the  time  between  any  two  successive  switchings  of 
7]  can  never  be  smaller  than  r/>.  Said  differently,  i]  "dwells”  at  each  of  its  values 
for  at  least  ro  time  units.  Because  of  this,  infinitely  fast  switching  cannot  occur 
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so  existence  and  uniqueness  of  solutions  to  the  differential  equations  involved  is 
not  an  issue. 


Analysis:  There  is  a  fairly  straight  forward  way  to  go  about  analyzing  the  type 
of  supervisory  control  system  weVe  just  described.  The  key  step  is  to  prove 
that  switching  stops  in  finite  time.  In  particular,  for  the  problem  at  hand  the 
trick  is  to  show  that  there  must  be  a  finite  integer  depending  only  on  the  pair 
(Cg,  Aq)  and  not  on  the  initial  value  of  ar,  which  r]  cannot  exceed  in  value.  Before 
we  address  this  claim,  let  us  consider  its  consequences.  What  the  claim  implies 
is  that  Tj  can  switch  at  most  a  finite  number  of  times  and  therefore  that  there 
must  be  a  time  t  beyond  which  rj  is  constant  and  tt  <  77^.  The  latter  assures  that 

e(s)|pds  <  fjn,  t  >i 

and  thus  that  e  has  a  finite  £^[0,  00)  norm.  Moreover  since  (Cg,  Ag)  is  detectable 
on  Q  and  y  is  fixed  at  some  value  q  E  Q,  {Cg,Ag)  is  detectable^.  As  a  conse¬ 
quence,  for  t  >i  it  is  possible  to  rewrite  (6)  as 


X  =  {Ag  +  KCg)x  -  Ke 

where  K  is  any  matrix  which  stabilizes  Ag  -j-  KCg.  Therefore  x  ^  0  since  e  has  a 
finite  £^[0,  00)  norm.  In  fact,  because  x  =  AgX  is  a  time  invariant  linear  system, 
X  must  go  to  zero  as  fast  at  -A  being  the  largest  of  the  real  parts  of  Ag's 

stable  eigenvalues.  In  other  words,  to  prove  that  x  ^  0  {and  consequently  that  x 
has  a  finite  limit  and  that  ct  — 0}  it  is  enough  to  show  that  there  is  an  integer 
f]  which  77  cannot  exceed  in  value. 

Here  briefly  is  the  idea.  Let  p  be  any  point  in  Q  at  which  Ap  is  a  stability 
matrix.  If  Cp  =  0,  let  77  be  the  first  positive  integer  such  that  h{7i)  =  p.  Then 
either  there  is  an  interval  of  maximal  length  on  which  77  =  ^  or  77  never 

gets  as  large  as  77.  If  the  latter  is  true,  then  we  are  done.  On  the  other  hand,  if 
the  former  is  true,  then  tt  =  0  on  so  for  such  t,  n  <  rjir.  Because  of  1:5 ’s 

definition,  this  means  that  no  more  switching  can  occur,  that  =  00  and  thus 
that  77  can  grow  no  larger  than  77. 

Now  suppose  Cp  0.  Reduce  (Cp.Ap)  to  an  observable  pair  (C,  A)  by  picking 
any  full  rank  matrix  R  whose  kernel^  is  the  unobservable  space  of  {Cp,Ap)  and 
solving  the  jinear  equations  Cp  =  CR,  RAp  ~  AR  for  C  and  A  respectively. 
Note  that  A  must  be  a  stability  matrix  because  Ap  is. 

Let  G{t)  denote  the  observability  Gramian 


e^^CCe^^ds 


Note  that  G(oo)  must  exist  because  of  A’s  stability.  Moreover,  G{td)  must  be 
positive  definite  because  of  the  observability  of  (C,  A).  This  implies  that 


A 

p  zz  sup 

X 


x^G{oo)x 

x'G{rD)x 


<  00 


^  There  is  of  course  no  reason  to  assume  that  Aq  is  a  stability  matrix. 
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and  that 

G(^do)  <  tiGiro)  (8) 

Now  let  Tjp  be  the  least  integer  no  smaller  than  //  for  which  —  p.  Because 

of  the  revisitation  property,  Pp  must  necessarily  exist.  In  view  of  (8) 

G(oo)<ppG{To)  (9) 

We  claim  that  p  ^  pp  has  the  desired  property.  To  prove  that  this  is  so,  we 
may  as  well  assume  that  there  is  an  interval  on  which  p  —  p.  For  if  this  were  not 
the  case  then  a  could  not  exceed  p  and  we  would  be  done. 

Let  [^0,  ^i)  denote  the  largest  interval  on  which  p  —  p.  For  t  G 
a  —  h{p)  —  p  and 

;r(<)  =  r  IlCpe^C— <  r  =  ||yGMi?x’(io)||“ 

Jto 

From  this,  (9)  and  the  definitions  of  p  and  n  it  follows  that  for  i  G  [^oTi), 

<  VqWV  G{TD)Rx{to)  I  p  =  P  f  \\Ce^^  Rx{to)\\‘^ds  =  pir 

Jo 

Thus  because  of  Ifs’s  definition,  no  more  switching  can  occur,  =  oc  and  thus 
p  can  grow  no  larger  than  p.  M 

There  are  many  provably  correct  versions  of  the  algorithm  we’ve  just  analyzed 
[7,  8,  9,  10,  37,  38].  All  employ  a  tuning  error  satisfying  the  aforementioned 
requirements,  a  performance  signal,  a  routing  function  and  a  switching  logic 
similar  to  the  one  we’ve  described.  Usually  td  is  an  increasing  function  of  p 
rather  than  a  constant.  In  most  cases,  the  proof  technique  employed  relies  on 
the  cessation  of  switching  in  finite  time.  The  selection  of  tt  and  the  definition  of 
Us  are  made  to  insure  that  this  is  so. 

Although  nonestimator  based  supervisors  are  prerouted  tuners,  the  converse 
is  not  necessarily  true.  For  example,  it  is  quite  possible  for  a  supervisor  employing 
“estimators”  to  use  prerouted  tuning  to  generate  cr.  Supervisors  admitting  this 
structure  have  in  fact  been  studied  in  [40].  This  reference  actually  examines  the 
convergence  properties  of  a  variety  of  estimator-based  switching  logics. 

The  findings  of  [40]  and  earlier  work  clearly  suggest  that  some  of  the  concepts 
we’ve  covered  here  have  a  universal  character  and  may  well  be  extendable  to 
significantly  broader  classes  of  problems  than  have  been  considered  so  far.  In 
the  sequel  we  briefly  summarize  some  preliminary  thoughts  along  these  lines. 

Generalization:  Let  Q  be  a  countable  set.  Suppose  that  for  each  g  G  Q,  Aq  : 
IRA  IR”  is  a  smooth,  possibly  nonlinear  function  and  that  for  some  q"  E  Q, 
the  zero  state  of 

X  —  Aq*  (x) 

is  a  globally  asymptotically  stable  equilibrium.  Assume  that  for  each  piecewise 
constant  switching  signal  a  :  [0,  oo)  Q,  all  solutions  to  the  differential  equation 

X  ^  Aa{x)  (10) 
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exist  on  [0,  oc).  Our  aim  is  to  briefly  outline  how  one  might  go  about  constructing 
a  nonestimator  based  supervisor,  not  depending  on  q*  or  precise  knowledge  of 
the  Aq^  which  cause  all  “supervised  ”  (i.e.,  closed-loop)  solutions  to  (10)  to  tend 
to  zero  as  t  oo. 

Suppose  it  is  possible  to  construct  a  smooth  function  b  :  ^  IR  such  that 

P,(^)ll<l|6WII.  VxeiR",  qeQ  (11) 


and  for  some  q  E  Q 


where  0  :  [0,  oo)  x  IR"  IR"  is  the  flow  of 

X  =  Aq(x) 


(12) 


(13) 


initialized  at  z.  Requirement  (11)  is  relatively  mild  and  can  typically  be  satisfied 
without  precise  knowledge  of  the  Ag.  Implicit  in  (12)  is  the  requirement  that  the 
zero  state  of  (13)  is  {at  least}  an  asymptotically  stable  equilibrium;  in  fact,  for 
the  requirement  to  make  sense  as  it  stands,  all  solutions  to  (13)  would  have  to 
have  finite  £^[0,oo)  norms. 

We  claim  that  the  supervisor  we’ve  already  described  will  accomplish  the 
prescribed  task  provided 


A 

e  — 


b{x) 


(14) 


The  reasoning  upon  which  this  claim  is  based  is  as  follows. 

First  of  all  note  that  satisfaction  of  (12)  guarantees  that  rj  cannot  exceed 
the  least  integer  fj  no  smaller  than  fi  for  which  h{fi)  =  q.  The  argument  which 
justifies  this  assertion  exploits  the  inequality 


<p{t~to,z) 


dt 


rto 

<v  / 

Jto 


^O  +  T’D 


<j){i 


^  0  ^  b  j  2^  G  IR^ 


and  is  essentially  the  same  as  before.  The  inequality  is  a  consequence  of  (12). 

At  this  point  we  need  a  good  working  definition  of  detectability  for  nonlinear 
systems.  Suppose  we  agree  to  call  a  smooth  dynamical  system  of  the  form 


X  —  A[x) 

e:=^C[x)  (15) 


detectable  if  there  exists  a  positive  definite,  radially  unbounded,  continuously 
differentiable  function  V  :  IR"  IR  such  that 


— yl(a;)-||C-(a;)|p<0,  x  €  IR",  a;  ^  0 


(16) 
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-  The  definition  characterizes  detectability  more  as  a  generalization  of  stability 
than  of  observability;  note  for  example,  that  if  C(x)  =  x,  (15)  may  not  satisfy 
the  definition,  even  though  for  this  example  (15)  would  certainly  have  to  be 
considered  an  observable  system. 

-  In  the  linear  case  when  C(x)  =  Cx  and  *4(.r)  =  Ax.  the  standard  definition  ol 
detectability  is  known  to  be  equivalent  to  the  existence  of  a  positive  definite 
matrix  P  which  satisfies  the  matrix  inequality  PA  +  A' P  —  C'C  <  0  [41]; 
since  V  =  x^Px  satisfies  (16),  the  definition  of  detectability  proposed  here 
thus  has  the  virtue  of  reducing  to  the  standard  one  in  the  linear  case. 

-  It  can  be  easily  shown  that  if  (15)  is  a  detectable  nonlinear  system  and  e 
has  a  finite  £^[0,  oo)  norm  along  some  solution  x,  then  x  must  tend  to  zero. 
Thus  the  proposed  definition  fulfills  the  intuitively  appealing  requirement 
that  smallness  of  the  output  of  a  detectable  system  ought  to  imply  smallness 
of  the  system’s  state. 

Returning  to  our  problem  we  point  out  that  (11)  implies  that  for  each  fixed 
q  G  Q,  the  dynamical  system 


X  -  Ag(x) 


e  = 


X 

b{x) 


(17) 


is  detectable  through  e.  This  can  be  verified  using  the  function  V  = 

The  steps  involved  in  showing  that  x  — ^  0  are  clear.  Since  switching  stops, 
TT  is  bounded  which  means  that  e  must  have  a  finite  £“[0,  oo)  norm.  Suppose  q 
is  cr’s  final  value.  Then  (17)  governs  the  evolutions  of  x  and  e  after  switching 
stops.  Because  (17)  is  a  detectable  system  and  e  has  a  finite  £^[0,oo)  norm,  x 
must  tend  to  zero  as  claimed. □ 

There  are  of  course  plenty  of  practical  reasons  why  one  would  not  want  to 
seriously  consider  implementing  the  system  just  described.  On  the  other  hand, 
there  are  components  of  the  preceding  {e.g.,  the  notion  of  detectability  and  how 
to  use  it}  which  will  no  doubt  prove  useful  in  the  analysis  of  more  meaningful 
algorithms. 

One  drawback  of  many  “switched”  control  systems  including  the  ones  we’ve 
discussed  so  far,  is  that  they  make  use  of  signal  which  grows  monotonically 
with  time.  For  the  supervisor  we’ve  described  this  would  be  ??.  Since  bounded 
monotone  signals  converge,  switched  systems  which  employ  them  tend  to  be 
fairly  easy  to  analyze.  The  problem  is  that  when  bounded  noise  and  or 
exogenous  disturbances  signals  are  present,  monotone  signals  tend  to  blow  up. 
To  get  around  this,  it  is  generally  necessary  to  eliminate  monotone  signals  alto¬ 
gether,  usually  by  introducing  “forgetting  factors”  or  “exponential  weighting”  of 
some  form  [8,  38,  24].  What  this  means  is  that  with  such  modifications  in  place, 
switching  can  no  longer  be  expected  to  terminate  in  finite  time.  As  a  result  one 
is  usually  confronted  with  an  analysis  problem  which  is  very  much  more  chal¬ 
lenging  than  that  encountered  in  the  noise-free  case  when  monotone  convergence 


4  Self-Adjusting  Control 


85 


could  be  counted  on.  Because  of  this  there  is  a  specific  need  for  technical  results 
appropriate  to  the  analysis  of  systems  within  which  switching  never  terminates. 

'  Perhaps  the  most  serious  criticism  of  the  nonestimator  approach  is  its  reliance 
on  prerouted  tuning.  Clearly  if  Q  is  a  large  set,  one  should  not  expect  a  prerouted 
supervisory  control  system  to  perform  very  well. 


4.3  Estimator-Based  Supervisors 

The  overall  responsibility  of  any  multi-controller  supervisor  can  be  divided  into 
a  scheduling  task  -  deciding  when  to  switch  controllers  -  and  a  routing  task 
-  deciding  which  controller  to  switch  to  next.  Nonestimator-based  supervisors 
have  the  routing  question  decided  for  them  and  are  thus  designed  to  deal  only 
with  scheduling.  It  is  natural  to  expect  that  improved  overall  performance  can  be 
achieved  by  employing  a  supervisor  endowed  with  the  capability  of  making  both 
scheduling  and  routing  decisions  in  real  time.  An  important  class  of  supervisors 
possessing  this  capability  are  those  which  are  estimator-based.  Estimator-based 
supervisors  utilize  a  form  of  certainty  equivalence  and  as  such  are  in  some  ways 
quite  similar  to  conventional  estimator-based  tuners  encountered  in  parameter 
adaptive  control. 

Since  an  estimator-based  supervisor  is  responsible  for  both  scheduling  and 
routing,  it  is  not  surprising  that  defining  one  should  require  a  more  detailed 
description  of  Cp  then  we’ve  assumed  so  far.  For  illustrative  purposes  suppose 
Cp  to  be  of  the  form 

Cp  =  U  C(p) 

per 

where  P  is  a  closed,  bounded  {possibly  finite}  subset  of  a  real,  finite-dimensional 
linear  space.  Here  C{p)  denotes  the  subclass 

C(p)  =  W+rf:||<J|U<ep} 

where  i/p  is  a  preselected,  reduced,  strictly  proper  nominal  transfer  function,  Cp 
is  a  real  non-negative  number  and  J  is  a  stable,  strictly  proper  norm-bounded 
perturbation  representing  unmodelled  dynamics  of  the  additive  type;  ||  •  ||oo 
denotes  the  shifted  infinity  norm 

||(J||co=  sup  |(5(s)|, 
sec(A„) 

where  is  a  prespecified  positive  number  called  the  unmodelled  dynamics  sta¬ 
bility  margin,  and  C{Xu)  is  the  subset  of  the  complex  plane  consisting  of  all 
points  on  and  to  the  right  of  the  vertical  line  s  =  —  A^.  Assume  for  each  p  EV, 
that  the  allowable  values  of  S  exclude  transfer  functions  for  which  i/p+S  has  un¬ 
stable  poles  and  zeros  in  common.  All  transfer  functions  in  Cp  are  thus  strictly 
proper,  but  not  necessarily  stable  rational  functions. 

As  before,  we  take  as  given  a  parameterized  a  family  of  admissible  controller 
transfer  functions  JC  which  has  the  property  that  for  each  transfer  function  r 
in  Cp  there  is  at  least  one  controller  transfer  function  k  E  1C  which  internally 
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stabilizes  the  interconnection  shown  in  Figure  3.  F3ecause  estimator-based  su¬ 
pervisors  base  decision-making  on  the  idea  of  certainty  equivalence,  to  configure 
such  a  supervisor  it  is  necessary  to  first  specify  a  well-defined  function  F  from 
the  nominal  process  model  transfer  function  class  \J  =  {up  :  p  £  V]  to  K  in  such 
a  way  that  the  assignment  Up  i — y  F{iyp)  meets  prescribed  specifications,  (dven 

F,  a  natural  way  to  make  this  assignment  explicit  is  to  stipulate  that  V  be  a 

/\ 

subset  of  /C’s  parameter  space  Q  and  then  to  define  Kp  —  F{iyp)  for  each  p  EV. 
For  the  present  we  shall  actually  take  Q  .  The  reader  should  realize  however 
that  there  are  situations  in  which  it  is  advantageous  to  choose  Q  larger  than 
■p.  For  example,  picking  Q  larger  than  V  makes  it  possible  to  define  generalized 
supervisors  whose  controller  selection  strategies  are  not  based  just  on  certainty 
equivalence  alone  {c.f.  §4.4}. 

Assume  that  the  transfer  functions  in  JC  satisfy  the 
Stability  Margin  Requirement:  For  each  p  EV  the  real  parts  of  the  closed- 
loop  poles  of  the  feedback  interconnection  shown  in  Figure  5  are  less  than  —\s 
where  Xs  some  prespecified  positive  number  called  a  stability  margin. 


Fig.  5.  Feedback  Interconnection 

In  concept,  an  estimator-based  supervisor  can  be  explained  in  terms  of  the 
“multi-estimator”  architecture  shown  in  Figure  6. 


y 


Fig.  6.  Multi-Estimator  Configured  Supervisor 


where  each  pp  is  a  suitably  defined  estimate  of  y  which  would  be  asymptotically 
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correct  if  i/p  were  the  process  modehs  transfer  function.  For  each  p  £V, 

ep  =  yp-y  (18) 

denotes  the  pth  output  estimation  error;  7Vp  is  a  “normed”  value  of  Cp  or  a 
“performance  signal”  which  is  used  by  the  supervisor  assess  the  potential  per¬ 
formance  of  controller  p.  Us  is  a  switching  logic  whose  function  is  to  determine 
(7  on  the  basis  of  the  current  values  of  the  tt^. 

The  underlying  decision  making  strategy  used  by  an  estimator-based  super¬ 
visor  of  the  ‘non-prerouted’  type  is  basically  this:  From  time  to  time  select  for 
V,  that  candidate  control  signal  Vp  whose  corresponding  performance  signal  TTp 
is  the  smallest  among  the  p  eV.  What  makes  a  non-prerouted  supervisor 
such  as  this  distinctly  different  from  a  prerouted  one  is  thus  the  philosophy  un¬ 
derlying  the  method  it  uses  to  carry  out  its  task.  In  particular,  a  non-prerouted 
supervisor  decides  which  controller  to  put  in  the  feedback  loop,  not  by  search 
along  a  predetermined  route  in  )C,  but  rather  by  continuously  comparing  in  real 
time  suitably  defined  normed  output  estimation  errors  or  performance  signals 
associated  with  the  admissible  nominal  process  models.  Motivation  for  this  idea 
is  obvious:  the  process  model  whose  associated  performance  signal  is  the  small¬ 
est,  “best”  approximates  what  the  process  is  and  thus  the  candidate  controller 
designed  on  the  basis  of  that  model  ought  to  be  able  to  do  the  best  job  of  con¬ 
trolling  the  process.  The  origin  of  this  idea  is  of  course  the  concept  of  certainty 
equivalence  from  parameter  adaptive  control. 

By  an  estimator  of  y,  based  on  transfer  function  is  meant  a  linear  system 
of  the  form 

Xp  —  ApXp  -{-  dpy  “h  bpXi 
yp  —  ^p^p 

where  {Ap  -\-  dpCp^  bp,Cp}  is  a  realization  of  i/p  and  Ap  is  a  stability  matrix.  It  is 
easy  to  verify  that  any  such  realization  necessarily  fulfills  the  requirement  that  yp 
be  an  asymptotically  correct  estimate  of  y  if  the  process  model  transfer  function 
were  i/p.  Notice  that  such  realizations  are  invariably  detectable  because  of  ^p’s 
stability.  For  the  present  we  are  only  going  to  consider  realizations  which  are 
stabilizable  as  well,  even  though  by  doing  so  we  are  sidestepping  some  subtle  but 
important  issues{cf.,  §4.4}.  There  are  many  ways  to  construct  estimators  which 
meet  these  requirements.  For  example,  if  n  is  an  upper  bound  on  z/p’s  McMillan 
Degree,  yp  can  always  be  be  generated  by  an  observer-based  estimator  of  the 
form 

Xp  =  AoXp  dpy-\-bpU 

yp  —  ^o^p  (^1) 

where  {co^Aq)  is  an  n-dimensional,  parameter-independent  observable,  stable 
pair  and  {Aq  +dpCo,bp,  cq}  is  a  stabilizable  realization  of  z/p.  It  is  also  possible 
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to  generate 


using  an  identifier-based  estimator  of  the  form 


'Ai 

0  ■ 

'bi' 

'  0  ■ 

Xf  - 

0 

A I 

Xj  + 

0  _ 

y  + 

Vp  =  Cp.r/ 


where  [Ai.hi]  is  a  parameter-independent,  n-dimeiisional  siso,  controllable  pair 
with  A I  stable  and 


{\Aj  0 

U  0 


Cp 


is  a  stabilizable  realization  of  v^.  Note  that  the  state  of  this  estimator  is  inde¬ 
pendent  of  p,  whereas  the  state  of  the  observer-based  estimator  in  (21)  is  not. 
What  this  means  is  that  if  n  is  an  upper  bound  on  the  McMillan  Degrees  of  all 
of  the  nominal  transfer  functions  in  W,  then  all  of  the  yp  can  be  generated  using 
a  single  estimator  with  shared  state  xj  and  parameter-dependent  readout  map 

Cp  . 

There  is  a  third  way  to  generate  yp  which  is  very  similar  to  the  second  but 
which  is  especially  well-suited  to  the  set-point  control  problem  under  consider¬ 
ation.  In  this  case  one  uses  an  identifier-based  estimator  Ee  of  the  form 


XE  = 


Ae  0 
0  Ae 


xe  + 


0 


y  + 


0 

bE 


yp  -  cpXE 


(22) 

(23) 


where  (T^;,  6^)  is  a  parameter-independent,  (n+l)-dimensional  siso,  controllable 
pair  with  Ae  stable  and 


Ae  0 
0  Ae 


+ 


bs 

0 


'-'P  5 


0 

bE 


is  a  stabilizable  realization  of  ji/p,  A  state-shared  implementation  based  on  this 
estimator  would  then  appear  as  in  Figure  7.  Naturally  this  architecture  can  only 
be  implemented  as  it  stands  if  the  number  of  output  estimation  errors  is  finite; 
i.e.,  if  P  is  a  finite  set.  It  turns  out  however  that  such  a  supervisor  can  often 
be  implemented  using  a  simpler  architecture  -  one  which  permits  V  to  contain  a 
continuum  of  points.  To  explain  why  this  is  so,  it  is  useful  to  formalize  the  idea 
of  a  supervisor. 

By  an  estimator-based  supervisor  {cf,  Figure  8}  is  meant  a  specially  struc¬ 
tured  hybrid  dynamical  system  whose  output  a  is  a,  switching  signal  taking  values 
in  Q  and  whose  inputs  are  v  and  y.  Internally  such  a  supervisor  consists  of  three 
subsystems:  a  state-shared  estimator  Ee,  a  performance  weight  generator  Ew 
and  a  switching  logic  Es^  Ew  is  a  causal  dynamical  system  whose  inputs  are  xe 
and  y  and  whose  state  and  output  IT  is  a  “weighting  matrix”  which  takes  values 
in  a  linear  space  W.  W  together  with  a  suitably  defined  performance  function 
77  :  W  X  P  ^  IR  determine,  for  each  p  E  V,  scalar-valued  performance  signals 
of  the  form 


TTp  =  n(w,p),  pev 


(24) 
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Fig.  7.  State-Shared  Estimator-Based  Supervisor 


These  performance  signals  play  the  same  role  as  before;  i.e.,  tt^  is  considered  to 
be  a  measure  of  the  expected  performance  of  control  signal  p.  One  possible  pair 
of  definitions  for  Ew  and  77  is 

W  = -2XW (25) 

L  2/  J  L  2/  J  ^ 

and 

n{W,p)  =  [cp  ~l]W[cp  -1]'  (26) 

respectively  where  A  is  a  prespecified  nonnegative  number.  In  the  light  of  (18) 
and  (23)  it  is  eaisy  to  see  that  these  definitions  imply  that 

TTp  = -2\Trp  + e^,  p^V  (27) 

Although  we  will  deal  here  exclusively  with  such  “exponentially  weighted  £2” 
performance  signal,  it  should  be  noted  that  it  is  possible  to  realize  other  types 
of  performance  signals  by  defining  W  and  77  in  other  ways.  For  example,  if 
■P  is  a  finite  set  {say  V  —  {1,2,  ...,m}}  and  if  Sy/  is  the  dynamical  system 
d’p  =  Icpl,  P  G  P  with  state  w  =  [wi  W2  •  •  •  defining  n{w,p)  =  Wp 

would  realize  the  performance  signal  TTp  =  |ep|.  Note  however  that  if  P  were 
not  finite,  this  particular  performance  signal  could  not  be  realized  with  W  finite 
dimensional. 


Fig.  8.  Estimator-Based  Supervisor 
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Hysteresis  Switching:  There  cire  a  number  of  different  ways  to  define  switch¬ 
ing  logic  Us-  In  the  sequel  we  shall  consider  two.  The  first,  called  ‘Tlysteresis 
Switching,”  was  originally  devised  for  switching  between  the  members  of  a  finite 
family  of  parameter  adaptive  controllers  [11,  12,  13],  We  shall  ex[)lain  this  logic  s 
basic  attributes  in  the  following  manner. 

Suppose  {fg  :  ^  G  <2}  is  a  family  of  functions  fg  :  x  [0,oc))  — >  IR'T  Our 

aim  is  to  study  the  behavior  of  the  dynamical  system 

^  =  x(0)  -  X’o  (28) 

where  cr  is  a  switching  signal  taking  values  in  Q^.  Suppose  that  V  —  Q,  and  that 
W  is  a  function  of  x  and  i  which  takes  values  in  W;  i.e., 

W=^g{xJ)  (29) 

As  before,  suppose  that  77:VVxP->IRisa  performance  function  and  that  for 
p  GV,  TTp  =  n{w,p)  is  a  performance  signal.  What  we  want  to  do  is  to  explain 
how  to  generate  a  switching  signal  a  which  under  certain  conditions,  converges 
to  a  value  g  G  Q  at  which  nq  is  a  bounded  signal.  The  algorithm  which  generates 
a  is  called  a  “hysteresis  switching  logic.” 

By  a  hysteresis  switching  logic  is  meant  a  hybrid  dynamical  system  Eh  whose 
input  is  W  and  whose  state  and  output  are  both  a. 


To  specify  Eh  it  is  necessary  to  first  pick  a  positive  number  h  >  0  called  a 
hysteresis  constant.  Eh's  internal  logic  is  then  defined  by  the  computer  diagram 
shown  in  Figure  9  where  for  X  G  W,  gx  denotes  a  value  of  g  G  Q  which  minimizes 
/7(X,g). 


Fig.  9.  Computer  Diagram  of  Eh 

^  For  the  set-point  control  problem  under  consideration,  x  would  represent  the  com¬ 
posite  state  {xp,  u,xe^  xc,  W}, 
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In  interpreting  this  diagram  it  is  to  be  understood  that  cr’s  value  at  each  of  its 
switching  times  t  is  its  limit  from  above  as  t  I  i.  Thus  if  ii  and  ii^i  are  any 
two  successive  switching  times,  then  a  is  constant  on  Note  that  the 

definition  of  Uh  implies  that  7Va{t){i)  ^  t  >  0,  g  E  Q  and  that 

^  q  E  Q  i  is  a  switching  time. 

The  functioning  of  Eh  is  roughly  as  follows.  Suppose  that  at  some  time  to, 
Eh  has  just  changed  the  value  of  cr  to  g.  cr  is  then  held  fixed  at  this  value  unless 
and  until  there  is  a  time  ti  >  to  at  which  tt^  +  /i  <  tt^  for  some  p  6  Q.  If  this 
occurs,  cr  is  set  equal  to  p  and  so  on. 

Note  that  since  all  the  supervisor  has  to  do  is  to  compute  values  of  p  G  'P 
which  minimize  ]J(W,p)  at  various  times,  there  is  in  principle  nothing  to  prevent 
V  from  containing  a  continuum  of  points.  Of  course  the  minimization  problems 
to  be  solved  must  be  tractable  and  the  time  it  takes  to  compute  these  minima 
needs  to  be  taken  into  account.  We  will  discuss  both  of  these  points  further  in 
the  sequel. 

For  the  present  our  objective  is  to  describe  some  of  the  properties  of  the 
closed-loop  system  determined  by  (28),  (29)  and  Eh  assuming  that  g  and  each 
fq  is  at  least  locally  Lipschitz  in  x  and  piece  wise-continuous  in  i.  Observe  that 
because  of  the  hysteresis  constant  h  and  the  assumed  smoothness  of  g  and  the 
fq,  there  must  exist  an  interval  (0,  ^i)  of  maximal  length  on  which  cr  is  constant. 
Either  this  interval  is  the  maximal  interval  of  existence  for  x  or  it  is  not  in 
which  case  x  is  bounded  on  [0,ti).  If  the  latter  is  true,  a  switch  must  occur  at 
and  again  because  of  the  hysteresis  constant  h,  the  continuity  of  x  and  the 
smoothness  of  g  and  the  fq,  there  must  be  an  interval  [^1,^2)  of  maximal  length 
on  which  a  is  constant.  Continuing  this  reasoning  we  conclude  that  there  must  be 
an  interval  [0,  T)  of  maximal  length  on  which  there  is  a  unique  pair  {x,  a}  with 
X  continuous  and  a  piecewise  constant,  which  satisfies  (28)  and  (29).  Moreover, 
on  each  proper  subinterval  [0,  r)  C  [0,T),  cr  can  switch  at  most  a  finite  number 
of  times. 

Our  aim  now  is  to  characterize  the  limiting  behavior  of  (T  as  t  — >  T.  For  this 
we  need  to  make  certain  “open-loop”  assumptions.  Let  S  denote  the  class  of  all 
piecewise-constant  functions  s  :  [0,  00)  Q.  In  what  follows,  for  each  s  E  S,  Tg 
is  the  length  of  the  maximal  interval  of  existence  for  the  equations 

i  -  fs{t){x,t),  x(0)  =  xo 

and  Xg  is  the  corresponding  solution.  We  make  the  following 

Assumption  1  (Open-Loop) 


1.  For  each  s  e  S  and  each  q  E  Q,  performance  signal  7rq{t)  =  n{g{xs{t),t),  q) 
has  a  limit  (which  may  be  infinite)  as  t  Tg. 

2.  There  exists  at  least  one  point  q*  E  Q  such  that  for  each  s  e  S,  performance 
signal  7rg*(t)  =  n{g(xs(t),t),q*)  is  bounded  on  [OjT^). 
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These  assumptions  enable  one  to  prove  the  following  [12]. 

Lemma  1  Hysteresis  Switching  .  For  fixexl  initial  state  {xq^cto)  G  x  Q, 
let  (*c,(t)  denote  the  unique  solution  to  (28)  and  (29)  until  a  the  output  of  L}}  - 
and  suppose  [O/F)  is  the  largest  interval  on  which  this  solution  is  defined.  If  the 
open-loop  assumptions  hold,  there  is  a  time  T*  <  I  beyond  which  a  is  constant 
and  no  more  sivitching  occurs.  Moreover,  7r^(T*)  bounded  on  [0,T). 

Analysis:  What  we  want  to  do  next  is  to  very  briefly  sketch  how  one  might  use 
the  Hysteresis  Switching  Lemma  to  to  analyze  the  closed-loop  behavior  of  the 
supervisory  control  system  shown  in  Figure  10. 


Fig.  10.  Supervisory  Control  System  Using  Hysteresis  Switching 


Here  is  a  globally  detectable/stabilizable  realization  of  Kq  with  state  xc, 

Ee  is  the  globally  detectable/stabilizable  estimator  defined  by  (22)  and  (23), 
and  Eh  is  a  hysteresis  switching  logic.  Assume  that  A  =  0  and  that  E\y  and 
n  are  defined  by  (25)  and  (26)  respectively.  Therefore  in  this  case,  TTp  is  the 
performance  signal 

TTp  =  Cp,  p£V  (30) 

Note  that  the  Open-Loop  Assumption  1  automatically  holds  because  all  of  the 
TTp  are  monotone  functions. 

It  can  be  shown  [24]  that  there  are  constant  vectors  h  and  h,  singly  indexed 
matrices  Ap,dp,gp,  and  Cp  and  doubly  indexed  matrices  fqp  and  Cqp  such  that 


for  all  constant  r 

=x  +  hr  (31) 

X  =  {Ai  +  bfai)x  +  d/et  (32) 

ep  =  Cpix  +  ei  peV  (33) 

V  =  faix  +  g^ei  (34) 

e'T  =  ei  +  cix  (35) 


where 
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Because  of  the  Stability  Margin  Requirement,  it  also  turns  out  to  be  true  that 
the  matrix  pairs  (c^/,  ^4/  +  6/p/),  pj  £V  are  each  detectable.  These  claims  can 
be  verified  in  a  straight  forward  manner  by  direct  analysis  of  the  equations  under 
consideration. 

What  we  want  to  do  next  is  to  very  briefly  outline  how  one  might  use  the 
Hysteresis  Switching  Lemma  to  analyze  the  closed-loop  behavior  of  the  super¬ 
visory  control  system  shown  in  Figure  10  under  the  assumption  that  for  some 
p*  £  V ,  nominal  transfer  function  i/p*  matches  or  equals  that  of  Ep.  The  ex¬ 
act  matching  assumption  provides  exactly  one  new  piece  of  information,  namely 
that  Cp*  must  go  to  zero  as  fast  as  where  —  A^;  is  the  largest  of  the  real 

parts  of  the  eigenvalues  of  Because  of  (30)  this  means  that 

lim  TTp*  (0  =  C  <  oo 

t^oo  ^  ^ 

Thus  Open-Loop  Assumption  2  is  satisfied. 

In  view  of  the  Hysteresis  Switching  Lemma  there  there  must  be  a  time  T* 
beyond  which  a  is  constant  and  no  more  switching  occurs.  Moreover, 
must  be  bounded  on  the  maximal  interval  of  existence  [0,T)  for  solution  to  the 
overall  system  of  equations  involved.  Because  switching  has  stopped,  it  can  be 
shown  that  the  solution  in  question  in  fact  exists  globally  {i.e.,  T  —  oo}. 

Suppose  that  q  is  the  final  value  of  cr.  Since  tt^  is  bounded  on  [0,  oo),  Cq  must 
have  a  finite  £^[0,  oo)  norm  because  of  (30).  Next  observe  that  for  t  sufficiently 
large  and  I  =  p* ,  (32)  can  be  written  as 


X  —  (Ap*  -f  bfqp*)x  dp*ep*  (36) 

In  view  of  the  detectability  of  (c^p*,  Ap*  -h  bfqp*),  there  must  exist  a  matrix  k 
which  stabilizes  Ap*  -h6/^p*  -f^c^p*.  Thus  because  of  (33),  (36)  can  be  rewritten 

ctS 

X  =  (Ap*  -h  bfqp*  +  kcgp*)x  —  kcp  -f  (/:  +  dp*)ep* 

Since  Ap*  -f  6/gp*  -f-  kcqp*  is  a  stability  matrix  and  both  Cq  and  Cp*  have  finite 
£^[0,  oo)  norms,  x  must  have  a  limit  of  zero  as  f  oo.  Therefore  xp  and  xc  must 
have  a  finite  limits  because  of  (31).  So  also  must  v  because  of  (34).  Moreover, 
since  x  and  Cp*  both  tend  to  zero,  so  must  eT  because  of  (35).  Therefore  y  ^  r. 
Since  y  and  v  have  finite  limits,  and  Ep's  transfer  function  is  nonzero  at  5  =  0, 
u  must  have  a  finite  limit  as  well.  In  other  words,  j/,  u,  v.xp,  and  xc  all  tend  to 
finite  limits  and  gt  — >  0. 

Note  how  detectability  has  once  again  played  a  central  role  in  the  analysis. 
Together  with  the  Hysteresis  Switching  Lemma  it  has  enabled  us  to  establish 
the  limiting  behavior  of  y,  u,  v,  xp,  and  xq  in  a  very  elementary  way. 

The  preceding  is  less  than  satisfactory  for  at  least  four  important  reasons: 

1.  If  r  /  0,  IT  will  grow  without  bound. 

2-  If  noise  and  disturbances  are  present  W  will  almost  certainly  grow  without 
bound. 

3.  The  analysis  fails  to  account  for  unmodelled  process  dynamics 


9^1 


4.  The  analysis  fails  to  account  for  computation  time;  i.e.,  the  time  it  takes  the 
supervisor  to  carry  out  the  calculations  necessary  to  select  a  new  control. 

A  possible  remedy  for  the  first  two  problems  would  be  to  introduce  a  forget¬ 
ting  factor  or  exponential  weighting  in  tlie  definition  of  TV  in  (25).  For  exam{)le, 
one  might  pick  A  >  0.  Of  course  any  such  change  would  make  the  resulting  system 
substantially  more  difficult  to  analyze  than  the  one  weVe  been  considering  since 
TTp*  would  no  longer  be  monotone  and  switching  would  not  necessarily  terminate 
in  finite  time.  Add  in  a  small  amount  of  unmodelled  dynamics,  and  the  analysis 
problem  would  become  even  more  difficult  because  it  would  no  longer  possible 
to  presume  at  the  outset  that  tends  to  zero  or  even  that  it  is  bounded.  Some 
progress  in  dealing  with  these  difficulties  has  recently  been  announced  in  [42]. 

Taking  into  account  computation  time  makes  things  even  more  difficult.  On 
the  other  hand,  the  reality  of  a  positive  computation  time  -  however  small  -  to 
some  extent  mitigates  the  need  for  hysteresis,  since  the  only  reason  for  introduc¬ 
ing  hysteresis  in  the  first  place  was  to  prevent  unbounded  chatter  [11].  Rather 
than  further  pursue  this  topic,  we  turn  instead  to  an  alternative  switching  logic 
which  takes  computation  time  directly  into  account  and  which  results  in  a  su¬ 
pervisory  control  system  which  can  be  shown  to  perform  its  function  in  the  face 
of  unmodelled  dynamics  and  exogenous  disturbances  [43]. 


Dwell-Time  Switching:  By  a  dwell-time  siottching  logic  [15]  Ud,  is  meant  a 
hybrid  dynamical  system  whose  input  and  output  are  W  and  a  respectively,  and 
whose  state  is  the  ordered  triple  {X,  r,  a}. 


w 


a 


Here  X  is  a  discrete-time  matrix  which  takes  on  sampled  values  of  FF,  and  r 
is  a  continuous-time  variable  called  a  timing  signal  r  takes  values  in  the  closed 
interval  [0,td],  where  is  a  prespecified  positive  number  called  a  dwell  time. 
Also  assumed  prespecified  is  a  computation  time  tq  <  to  which  bounds  from 
above  for  any  X  G  W,  the  time  it  would  take  a  supervisor  to  compute  a  value 
p  =  px  E  V  which  minimizes  n{X,p),  Between  “event  times”  r  is  generated 
by  a  reset  integrator  according  to  the  rule  r  —  1.  Event  times  occur  when  the 
value  of  r  reaches  either  to  ~  tc  or  to]  at  such  times  r  is  reset  to  either  0  or 
To  —  Tc  depending  on  the  value  of  Xo^s  state.  T/^’s  internal  logic  is  defined  by 
the  computer  diagram  shown  in  Figure  11  where  px  denotes  a  value  of  p  G  7^ 
which  minimizes  77(X,p). 

The  functioning  of  Xo  can  be  explained  as  follows.  Suppose  that  at  some  time 
^0,  Xo  has  just  changed  the  value  of  cr  to  p.  At  this  instant  r  is  reset  to  0.  After 
To  “  '^c  time  units  have  elapsed,  W  is  sampled  and  X  is  set  equal  to  this  value. 
During  the  next  tq  time  units,  a  value  p  —  px  is  computed  which  minimizes 
n{X,p).  At  the  end  of  this  period,  when  t  ~  to /if  n{X,px)  is  smaller  than 
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Initialize 

a 


T  =  0 


Fig.  11.  Computer  Diagram  oi  Ed 


77 (X,  cr),  then  a  is  set  equal  to  px,  ^  is  reset  to  zero  and  the  entire  process-  is 
repeated.  If  on  the  other  hand,  n[X^a)  is  less  than  or  equal  to  n{X^px)^  t 
is  reset  to  Tj[)  —  tc,  W  is  again  sampled,  X  takes  on  this  new  sampled  value, 
minimization  is  again  carried  out  over  the  next  tq  time  units . and  so  on. 

Note  that  Eo  is  scale  independent  in  that  its  output  <j  remains  unchanged 
if  its  performance  function- weighting  matrix  pair  (77,  W)  is  replaced  by  an¬ 
other  performance  function-weighting  matrix  pair  (77,  W)  satisfying  n{W,p)  — 
9n{W,p)^  p  G  'P,  where  0  :  [0,oo)  IR  is  a  positive  time  function.  This  is 
because  for  any  fixed  t,  the  values  of  p  which  minimize  n{W{t),p)  are  exactly 
the  same  as  the  values  of  p  which  minimize  0 {t) IT {W (t) ^  p) . 

Let  us  agree  to  call  a  piecewise-constant  function  a  :  [0,oo)  -4  V  admissible 
if  it  either  switches  values  at  most  once,  or  if  it  switches  more  than  once  and  the 
set  of  time  differences  between  each  two  successive  switching  times  is  bounded 
below  by  a  positive  number  p.  The  supremum  of  such  values  of  p  is  cr’s  dwell 
time.  Because  of  the  definition  of  Ed,  it  is  clear  its  output  a  will  be  admissible 
with  dwell  time  no  smaller  than  that  of  Eo^  This  means  that  switching  cannot 
occur  infinitely  fast  and  thus  that  existence  and  uniqueness  of  solutions  to  the 
differential  equations  involved  is  not  an  issue. 

Analysis:  What  we  want  to  do  next  is  to  very  briefly  outline  how  one  might  ana¬ 
lyze  the  closed-loop  behavior  of  the  supervisory  control  system  shown  in  Figure 
12  under  the  assumption  that  for  some  p*  E  nominal  transfer  function  Up* 
matches  or  equals  that  of  Ep.  Unlike  the  supervisory  control  system  consid¬ 
ered  in  the  last  section,  we  will  not  {and  probably  cannot}  prove  that  switching 
terminates  in  finite  time. 


9G 


Fig.  12.  Supervisory  Control  System  Using  Dwell-Time  Switching 


For  the  present  we  continue  to  assume  A  —  0  and  that  Uw  and  fl  are  defined 
by  (25)  and  (26)  respectively.  Thus 

T^p  =  el,  P^'P  (37) 

just  as  before.  For  simplicity  we  focus  only  on  the  case  in  which  "P  is  a  finite  set. 
Since  (31)-(35)  still  hold,  we  can  write 


(38) 

and 

X  =  {Ap^  +  bfap^  )x  +  dp*  tp*  (39) 

Cp  =  Cpp*a:  +  Cp.  peV  (40) 

—  lap*  ^  H“  9g^p*  (df  ) 

ex  =  ep*  -j-  Cp*x  (42) 


Since  the  exact  matching  hypothesis  implies  that  Cp*  goes  to  zero  as  fast  as 
6"^^^  it  must  be  true  that  the  set 


V* 


/i 


'^dt  <  oo, 


per 


(43) 


is  nonempty.  The  assumption  that  P  is  a  finite  set  can  be  used  to  prove  that 
there  must  be  a  finite  time  P  beyond  which  a  takes  values  only  in  V*  [24]. 

Let  \^p ]^p*  t  ^p'ip* 


•  •  ■ ,  Cp^p*  }  be  a  basis  for  the  span  of  {cpp*  :  p  G  P*  }■  Define 


e  =  Cx 


(44) 


These  definitions  together  with  (40)  imply  that  Cp^  — Cp*  is  the  ith  entry  of  e.  Since 
each  such  entry  has  a  finite  £^[0,  00)  norm,  e  must  have  a  finite  P“[0,  00)  norm 
as  well.  Note  also  that  the  definition  of  C  implies  that  there  must  be  a  bounded 
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function  s  :  V*  ^  for  which  s(p)C  ~  Cpp* ,  p  G  P”"-  In  view  of  this  and 

the  previously  noted  detectability  of  the  matrix  pairs  {cpi,Ai  +  6/p/),  pj  GV, 
it  must  be  that  the  matrix  pair  (C,Ap*  +  6/pp*)  is  detectable  for  each  pGV* . 

Note  that  for  any  appropriately  sized,  matrix  p  \ — )■  Kp  which  is  bounded  on 
P*,  (39)  can  be  rewritten  as 

X  —  (tI-p*  ”h  ^fap*  “h  -Ra^)^  H”  ^p*  ^p* 

for  t  >t*.  Suppose  that  such  a  function  Kp  can  be  shown  to  exist  for  which  the 
time-varying  matrix  Ap>  -f  bf^p*  +  KaC  is  exponentially  stable.  Then  because  e 
and  Cp.  have  finite  £^[0,  oo)  norms,  x  would  tend  to  zero.  Hence  xc  and  xe  would 
tend  to  finite  limits  because  of  (38).  Moreover  since  ep*  tends  to  zero,  (41)  and 
(42)  would  imply  that  v  and  ex  tend  to  zero  as  well.  As  a  consequence,  y  would 
tend  to  r  and  u  would  tend  to  finite  limit;  the  latter  would  be  true  because  of 
the  converging  of  y  and  v  to  constant  values  and  because  Up's  transfer  function 
is  nonzero  at  s  =  0.  In  other  words,  to  show  that  y  r  and  that  xc,  xp  and  u 
tend  to  finite  limits  its  enough  to  show  that  Ap*  +  6/ap*  +  KaC  is  exponentially 
stable  for  some  suitably  defined  function  Kp, 

We  claim  that  a  function  Kp  exists  provided  td  is  sufficiently  large.  To  un¬ 
derstand  why  this  is  so,  first  recall  that  (C,  Ap*  +  bfpp*)  is  detectable  for  each 
p  ^  7?*  _  Thus  for  each  such  p  there  must  be  a  constant  matrix  Kp  which  stabi¬ 
lizes  Ap*  -f  6/pp*  +  KpC.  Therefore  for  each  pGV^  \i  is  possible  to  find  numbers 
Up  >  0  and  Ap  >  0  for  which 

-ffe/pp*  4- I  g(ap  — '^pO  ^  >  0 

Since  ^  is  an  upper  bound  on  the  time  it  takes  for  to  drop 

below  one  in  value,  it  is  not  surprising  that  the  state  transition  matrix  of  Ap*  + 
bfap*  +  KaC  will  be  exponentially  stable  provided 

TD  >  sup  1 

p€V*  I'^P  J 

This  in  fact  can  be  shown  to  be  true  [24] .  Thus  we  may  conclude  that  if  td  is 
chosen  large  enough,  then  u,xc  and  xp  must  converge  to  finite  limits  and  and 
y  must  tend  to  r. 


Performance  Signals:  One  of  the  problems  with  the  preceding  is  that  W  will  not 
remain  bounded  if  r  0.  One  easy  way  to  remedy  this  problem  is  as  follows. 

Under  the  exact  matching  hypothesis,  Cp*  ->  0  as  fast  as  Thus  there 

is  a  non-negative  constant  Co  such  that  (^)  <  Pick  A  G  (OjA^;). 

Let  U  and  TTp  be  defined  as  in  (24)  and  (26)  respectively,  but  rather  than  using 
(25)  to  generate  W,  use  the  equation 


Xp 

XE 

.  y . 

.  y . 

(45) 


instead.  Clearly 

TTp  —  € 

As  defined,  TTp  has  three  crucial  properties: 
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1.  For  each  p  G  V,  TTp  is  monotone  nondecreasing. 

2.  linif^co  TTp*  =  C*  <  7rp.(0)-f-  <  oo 

3.  If  P*  is  defined  as  before,  then  a  must  take  values  only  within  V"  beyond 
some  finite  l  ime. 

These  are  precisely  the  properties  needed  to  define  C  and  e  as  in  (44)  so  that  e  has 
a  finite  £^[0,oo)  norm  and  that  {C,Ap*  +  bfpp*)  is  detectable  for  each  p  e 
In  other  words,  if  one  were  to  use  (45)  to  generate  W,  then  the  convergence 
properties  oi  y.XE.^c  ^^rid  u  would  still  hold. 

Now  consider  replacing  W  with  the  “scaled”  weighting  matrix 

w  =  (46) 

Note  that  IJ{W,p}  =  e~^^^n{W,p),  p  EV.  In  the  light  of  the  scale  indepen- 
dence  property  of  Eo  noted  previously,  it  must  be  that  replacing  W  with  W  has 
no  effect  on  a  and  consequently  on  y^XE,  ^^nd  u.  The  key  point  here  is  that 
the  weighting  matrix  W  defined  by  (46)  can  also  be  generated  directly  by  the 
stable  dynamical  system 


XE 

.  y . 

.  y . 

(47) 


Moreover,  since  y  and  xe,  tend  to  finite  limits,  it  must  be  that  W  {and  therefore 
its  sampled  state  X}  tend  to  finite  limits  as  well.  Thus  at  this  point  we  may 
conclude  that  if  te  is  chosen  large  enough,  if  A  is  picked  in  (0,  A^;),  and  if  W  is 
generated  by  (25),  then  u,xc,  W  must  converge  to  to  finite  limits  and 

and  y  must  tend  to  r. 


Fast  Switching:  A  key  step  in  the  analysis  just  given  was  to  show  that  for  the 
family  of  detectable  pairs  {(C,  Ap*  +  6/pp* )  :  p  G  P*},  there  exists  a  a  bounded, 
output  injection  function  Kp  and  a  dwell  time  te  for  which  Ap*  +  bf^p*  +  FqC 
is  exponentially  stable  for  any  admissible  switching  function  a  with  dwell  time 
no  smaller  than  te^  It  turns  out  that  for  any  given  positive  dwell  time  te,  it  is 
possible  to  find  a  function  Kp  which  exponentially  stabilizes  Ap*  Fhf<jp*  +  K^C 
for  any  admissible  switching  function  <j  with  dwell  time  no  smaller  than  te  [24]. 

To  reader  should  realize  that  detectability  of  such  matrix  pairs  is  by  itself  not 
sufficient  for  the  existence  of  a  function  Kp  with  the  aforementioned  property.  To 
understand  why,  just  consider  the  situation  in  which  a  family  of  detectable  pairs 
of  the  form  {(C,  Ap)  :  p  G  P}  has  a  zero  readout  matrix  C;  in  this  case  each  Ap 
must  be  a  stability  matrix  and  Ap  +  KpC  =  Ap  for  all  Kp.  It  is  well  known  that 
if  the  Ap  do  not  commute  with  each  other,  exponential  stability  of  Aa  cannot  in 
general  be  assured  unless  te  is  large  enough;  for  an  example  see  [44].  In  other 
words,  there  are  families  of  detectable  pairs  of  the  form  {[C,Ap)  :  p  G  P)  for 
which  no  stabilizing  function  Kp  exists  if  te  is  too  small.  What’s  especially 
interesting  is  that  if  {(C,  Ap)  :  p  G  P}  is  a  family  of  observable  matrix  pairs, 
then  no  matter  how  small  te  is,  there  does  in  fact  exist  a  matrix  function  Kp 
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with  the  required  stabilizing  property.  This  is  an  immediate  consequence  of  the 
following  result  [14]. 

Squashing  Lemma:  Let  [C^A]  be  a  fixed,  constant,  observable  matrix  pair,  and 
let  tq  be  a  positive  number.  For  each  positive  number  8  there  exists  a  positive 
number  A  and  a  constant  output-injection  matrix  K  for  which 

|g(A-FiCC)i|  ^  ^>0  (48) 


The  way  to  construct  Kp  for  a  family  of  observable  pairs  such  as  {{C,Ap)  : 
p  G  V],  is  as  follows.  Pick  8  G  (0, 1),  set  tq  =  td  and  for  each  p  G  7^  use  the 
Squashing  Lemma  to  find  a  value  of  Kp  for  which 

It  can  be  shown  that  with  Kp  so  chosen,  Aa  +  will  be  exponentially  stable 
if  cr  is  any  admissible  switching  signal  with  dwell  time  no  smaller  than  tq  [24]. 

Unfortunately,  for  the  problems  of  interest  in  this  paper,  the  matrix  pairs 
in  {(OjAp*  +  ^fpp*)  •  P  C  V*}  cannot  be  assumed  to  be  observable  without  a 
definite  loss  of  generality.  On  the  other  hand,  observability  is  in  general  sufficient 
for  stabilizability  whereas  detectability  is  not.  The  way  out  of  this  dilemma  has 
been  to  make  use  of  additional  properties  of  the  matrices  under  consideration. 
A  typical  result  along  these  lines  is  the  following. 

Switching  Theorem:  Let  Aq  >  0  and  tq  >  0  6e  fixed.  Let  (C^oxn,  Anxn,  ^nxm) 
be  a  left  invertible  system.  Suppose  that  {[Cp,Fp)  :  p  £  V}  is  a  closed,  bounded 
subset  of  matrix  pairs  in  0  with  the  property  that  for  each  p  eV, 

(Cp,  Aq/  +  a  +  BKp)  is  detectable.  There  exist  a  constant  a  >  0  and  hounded, 
matrix-valued  output  injection  functions  p  t — y  Hp  and  p  i — y  Kp  on  V  which, 
for  any  admissible  switching  signal  a  :  [0,oo)  V  with,  dwell  time  no  smaller 
than  To,  causes  the  state  transition  matrix  of 

AA-KaCo^HaC^BFa 


to  satisfies 


Using  this  theorem  it  has  been  possible  prove  that  for  any  dwell  time  greater 
than  zero  and  any  value  of  A  G  (OjA^;),  the  supervisory  control  system  we’ve 
been  discussing  achieves  set-point  regulation  and  global  boundedness  [24].  It  has 
also  been  possible  to  show  that  these  results  continue  to  hold  in  the  face  of  norm 
bounded  unmodelled  dynamics  provided  A  is  further  constrained  to  be  smaller 
than  both  the  stability  margin  A5  and  the  unmodelled  dynamics  stability  margin 
Xu  [43].  Moreover  the  introduction  of  bounded  noise  and  disturbance  inputs 
cannot  destabilize  the  system. 
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4.4  Cyclic  Switching 


As  we  have  just  expiaiiied,  estimator-based  supervisors  generate  control  signals 
in  accordance  with  the  idea  of  certainty  equivaleiu'e;  i.e.,  at  each  instant  of  time 
tlie  controller  in  feedback  with  the  process  is  based  on  a  current  estimate  of  what 
the  nominal  process  model  transfer  function  is;  such  estimates  are  selected  from 
a  suitably  defined  admissible  nominal  process  model  transfer  function  set  N . 
Because  N  must  be  finitely  parameterized,  it  can  always  be  regarded  as  a  subset 
of  a  finite  dimensional  linear  space.  In  practice,  M  is  typically  chosen  to  best 
satisfy  a  number  of  conflicting  requirements.  For  example,  M  should  be  “big” 
enough  to  ensure  that  Cp  includes  a  transfer  function  model  of  the  process.  If 
M  contains  a  continuum  of  transfer  functions,  then  for  on-line  model  estimation 
{i.e.,  minimization  of  /7(fF,  p)  }  to  be  tractable,  M  should  be  convex  or  at  least 
the  union  of  a  finite  number  of  convex  sets.  Since  each  transfer  function  in  A/*  is  a 
candidate  process  model  transfer  function,  for  the  formulated  problem  to  make 
sense,  each  such  transfer  function  should  be  at  least  stabilizable  {i.e.,  without 
any  unstable  poles  and  zeros  in  common}. 

It  is  not  very  difficult  to  see  that  these  are  conflicting  requirements.  In  par¬ 
ticular,  stabilizability,  convexity  and  largeness  of  are  at  odds.  If  stabilizability 
and  largeness  are  required,  then  convexity  and  consequently  tractability  must  be 
sacrificed.  If  convexity  and  stabilizability  are  required,  then  A/*  must  be  “small.” 

A  way  out  of  this  dilemma,  which  enables  one  to  achieve  tractability  while  re¬ 
taining  stabilizability  and  largeness,  is  to  embed  j\f  in  a  larger  set  of  ‘admissible’ 
transfer  functions  M  which  is  convex,  but  which  is  not  restricted  to  have  only  sta¬ 
bilizable  transfer  functions.  Naturally  those  transfer  functions  in  M  which  are  not 
stabilizable  cannot  be  candidate  process  model  transfer  functions.  Nevertheless, 
because  of  the  tractability  issue  it  is  useful  to  consider  such  transfer  functions 
to  be  admissible  for  estimation  purposes.  Therefore  an  alternative  to  certainty 
equivalence  is  needed  for  selecting  controllers  when  such  transfer  functions  are 
encountered  during  the  on-line  estimation  process.  Such  an  alternative,  based 
on  the  concept  of  “cyclic  switching,”  has  recently  been  proposed  for  applications 
in  parameter-adaptive  control  where  the  same  problem  also  arises  [45,  14].  The 
aim  of  this  section  is  to  explain  what  cyclic  switching  is  within  the  context  of 
the  set-point  problem  we’ve  been  considering. 

We  will  be  concerned  exclusively  with  the  case  when  jV  contains  a  contin¬ 
uum  of  reduced  transfer  functions.  For  simplicity  assume  that  each  such  transfer 
function  has  the  same  McMillan  Degree  n.  This  means  that  J\f  can  be  viewed 
as  a  subset  of  the  2n-dimensional  linear  space  of  strictly  proper  {unreduced} 
rational  functions  whose  denominators  are  monic  and  of  degree  n. 

As  before  we  assume  that  P  is  a  closed,  bounded  subset  of  a  finite  dimensional 
linear  space.  Assume  in  addition  that  the  coefficients  of  Up  are  defined  on  this 
space  as  affine  linear  functions.  Assuming 
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again  realizes  this  means  that  Cp  will  also  be  an  affine  linear  function.  As  a 
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consequence,  the  parameterized  performance  signal  n{W,p)  defined  by 
77(iy,p)  =  [cp  -l]V^[cp  -1]' 


W  -  -2\W  + 

XE 

XB 

.  y , 

.  y . 

(49) 

(50) 


will  be  a  quadratic  function  of  p. 

We  are  interested  in  the  case  when  V  is  not  necessarily  convex  since  convexity 
of  V  would  imply  convexity  of  N ^  To  ensure  a  tractable  minimization  problem, 
we  presume  that  V  has  been  embedded  in  a  conveniently  chosen,  closed,  bounded 
convex  subset  V  {e.g.,  the  convex  hull  oiV)  and  that  the  set  of  admissible  nom¬ 
inal  transfer  functions  has  been  enlarged  toj^  =  {up  :  p  £V}.  This  reduces  the 
problem  of  minimizing  n[W,p)  over  “P  to  a  finite  dimensional  convex,  quadratic 
programming  problem.  Such  problems  are  highly  tractable  and  many  fast  algo¬ 
rithms  for  solving  them  are  known. 

We  shall  assume  that  all  of  the  points  p  G  P  {if  any}  at  which  has  a 
pole-zero  cancellation  are  in  the  interior  of  a  specified  closed  set  5  C  P,  called 
a  singular  region.  {Therefore  jUp  can’t  have  any  pole-zero  cancellations  on  the 
closure  of  P  -  5.}  It  is  reasonable  to  require  Cp  and  {z^^  :  p  G  «S}  to  be  disjoint. 

In  the  sequel  we  will  define  a  generalized  supervisor  whose  decision  making 
strategy  takes  into  account  the  possibility  that  there  may  be  times  at  which  the 
best  possible  admissible  transfer  function  estimate,  determined  by  minimizing 
IT{W,p)  over  P,  falls  within  the  singular  set  {z/p  :  p  G  «S}.  To  define  such  a 
supervisor  two  things  are  needed: 

Controller  Requirements: 

1.  A  bounded  set  of  controller  transfer  functions  {/Cg  :  q  G  (P  —  «S)}  which 
satisfies  the  Stability  Margin  Requirement  on  P  —  <S;  i.e.,  for  each  p  G  (P  —  5) 
the  real  parts  of  the  closed-loop  poles  of  the  feedback  interconnection  shown 
in  Figure  13  are  less  than  —  A5. 


Fig.  13.  Feedback  Interconnection 


Since  jz/p  has  no  pole-zero  cancellations  on  the  closure  of  P  —  5,  such  a 
family  clearly  exists. 

2.  A  set  of  real  gains  {gi,g2,  •  •  ‘j9ns}  which  fulfills  the  Observation  Require¬ 
ment]  i.e.,  for  each  /  G  P  and  each  p  E  Sy  there  is  a  value  of  q  G  {1,  2, . . . ,  775} 
for  which  the  feedforward  interconnection  of  controllable,  observable  realiza¬ 
tions  of  z/p  and  z//  shown  in  Figure  14  is  observable  through  ejf.  It  can  be 
shown  that  such  a  family  exists  because  of  the  assumed  disjointness  of 
and  S  [14]. 
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Fig.  14.  Feedforward  Interconnection 


Assume  that  {Kg  :  q  e  {V  -  S)]  is  a,  bounded  set  of  controller  transfer  func¬ 
tions  which  satisfies  the  Stability  Margin  Requirement  and  that  K2,  . .  - ,  Kns) 
is  a  finite  family  of  gains  which  satisfies  the  Observation  Requirement.  In  addi¬ 
tion,  adopt  the  notation  X  =  {1,2, . .  .,^5}  and  write  Q  for  the  disjoint  union 
0  =  {p-S)\Jl.  Suppose  Xi'ciq)  is  a  globally  detectable/stabilizable  realization 
of  Kg  on  Q. 

The  overall  structure  of  the  supervisory  control  system  we  want  to  consider 
is  the  same  as  before,  except  that  now  instead  of  the  supervisor  uses  a 
yet-to-be-defined  “dwell-time/cyclic  logic”  XJdc- 


Fig.  15,  Supervisory  Control  System  Using  Dwell-Time/Cyclic  Switching 


Edc  is  essentially  a  combined  version  of  Eo  and  the  cyclic  switching  logic  of  [14]. 
The  underlying  strategy  upon  which  logic  is  predicated  can  be  explained 

roughly  as  follows.  Consider  again  the  by  now  familiar  equations 


^  ^x-^hr  (51) 

\ 

X  =  {Ai  -h  hfai)x  +  diet  (52) 

Cp  —  Cpix  ei  p  eV  (53) 

V  =  faix gaei  (54) 

gt  =  e/  +  cix  (55) 


which  hold  for  all  constant  r  and  all  l,p  e  V.  T’dc's  strategy  stems  from  two 
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facts,  each  a  direct  consequence  of  one  of  the  two  corresponding  Controller 
Requirement  stipulated  above: 

1.  For  each  /  C  V  and  each  p  ^  V  —  S  the  matrix  pair  {cpi,Ai  +  bfqi)\q=p  is 
detectable. 

2.  For  each  /  E  V  and  each  p  E  S  there  exists  a  9  E  2  such  that  {cpi,  Ai  +  bfqi) 
is  detectable. 

Intuition:  Here  roughly  is  the  idea  upon  which  cyclic  switching  is  based.  Think 
of  the  supervisor  as  performing  two  separate  tasks  -  one  estimation  and  the  other 
controller  selection.  The  estimation  task  amounts  to  minimizing  n{W,p)  over 
P  and  goes  on  over  and  over  without  interruption;  this  generates  a  sequence  of 
values  p  £V.  Meanwhile  the  supervisor  tries  to  select  controller’s  in  such  as  way 
as  to  maintain  “detectability”  through  at  least  on  the  average.  Why?  Because 
detectability  through  implies  smallness  of  x  whenever  is  small  -  and  small¬ 
ness  of  ought  to  be  a  consequence  of  the  estimation  process.  So  here’s  how 
the  supervisor  achieves  “detectability”  through  e^:  If  p  E  'P  —  5,  the  supervisor 
relies  on  property  1  above  and  certainty  equivalence:  detectability  is  achieved  by 
setting  a  —  p.  On  the  other  hand,  if  p  enters  5,  the  supervisor  relies  on  property 
2:  in  this  case  “detectability”  is  achieved  on  the  average  by  stepping  a  through 
each  of  the  values  J,  holding  fixed  on  each  such  value  for  a  prespecified  amount 
of  time. 

Formally  a  Dwell-Time/ Cyclic  Switching  Logic  Udc  is  a  hybrid  dynamical 
system  whose  input  and  output  are  W  and  a  respectively,  and  whose  state  is 
the  ordered  quintuple  {A",p,  r,  X  is  a  discrete-time  matrix  which  takes  on 

sampled  values  of  IT,  p  is  a  discrete-time  variable  taking  values  in  P,  r  is  a 
continuous-time  timing  signal  as  before,  and  /?  is  a  logic  variable  taking  values 
in  {0,  1}.  r  takes  values  in  the  closed  interval  [0,  max{ 725 r5,  tu}],  where  tj)  and 
Ts  are  a  prespecified  positive  numbers  called  a  dwell  time  and  a  cycle  dwell  time 
respectively.  As  before  vc  <  rmix{nsTs,T]y}  is  a  prespecified  computation  time 
which  bounds  from  above  for  any  X  E  FV,  the  time  it  would  take  the  supervisor 
to  compute  a  value  p  —  px  E  P  which  minimizes  n{X,p).  Between  “event 
times”  T  is  generated  by  a  reset  integrator  according  to  the  rule  f  =  1.  Such 
event  times  occur  for  (d  E  {0,  1},  when  the  value  of  r  reaches  either  T[P)  ~  tq 

or  T[(d)  where  T(0)  =  and  T(l)  =  tisTs;  at  such  times  r  is  reset  to  either 
0  or  T{I3)  —  Tc  depending  on  the  value  of  state.  internal  logic  is 

defined  by  the  computer  diagram  shown  in  Figure  16  where  px  denotes  a  value 
of  p  E  P  which  minimizes  77(X,p). 

The  functioning  of  Edc  can  be  explained  as  follows.  Suppose  that  at  some 
time  to,  Es  has  just  changed  the  value  of  p.  Depending  on  whether  p  E  5  or 
not,  one  of  two  different  epochs  can  occur: 

-  Suppose  p  ^  5.  In  this  case  (3  is  set  equal  to  0,  cr  is  set  equal  to  p  and  r 
is  reset  to  0.  After  To  —  rc  time  units  have  elapsed,  W  is  sampled  and  X 
is  set  equal  to  this  value.  During  the  next  tc  time  units,  a  value  p  —  px  is 
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Fig.  16.  Dwell-Time/Cyclic  Switching  Logic  Udc 


computed  which  minimizes  n{X,p).  At  the  end  of  this  period,  when  r  td, 
if  n{X,px)  is  smaller  than  77(X,p),  then  p  is  set  equal  to  px  and  the  logic 
goes  back  to  again  test  whether  or  not  p  G  If,  on  the  other  hand,  77(A',p) 
is  less  than  or  equal  to  n{X,px),  r  is  reset  to  td  -  rc ,  W  is  again  sampled, 
X  takes  on  this  new  sampled  value,  minimization  is  again  carried  out  over 
the  next  rc  time  units . and  so  on. 

-  Suppose  p  G  <S.  In  this  case  (3  is  set  equal  to  1,  r  is  reset  to  0,  and  two  distinct 
sequences  of  events  occur  simultaneously,  each  lasting  nsTs  time  units: 
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1.  At  r  =  0,  a  switching  cycle  is  executed^. 

2.  kir  —  usTs  —  rc.W  IS  sampled  and  X  is  set  equal  to  this  value.  During 
the  next  re  time  units,  a  value  p  =  px  is  computed  which  minimizes 
n[X,p).  At  the  end  of  this  period,  when  r  =  nsrs,  if  77(A,px)  is 
smaller  than  ri[X,p),  then  p  is  set  equal  to  px  and  the  logic  goes  back 
to  again  test  whether  or  not  p  G  S.  If,  on  the  other  hand,  n{X,p)  is 
less  than  or  equal  to  77(A,px),  r  is  reset  to  0,  another  switching  cycle 
is  executed  ....and  so  on. 

An  analysis  of  the  supervisory  control  system  just  described  can  be  found  in 
[46].  For  analyses  in  more  traditional  adaptive  control  contexts  see  [14,  22,  47]. 
The  techniques  used  [14,  22]  are  similar  to  those  outlined  in  the  last  section.  The 
discrete  time  case  is  completely  analyzed  in  [47]. 

5  Switched  Linear  Systems 

Existing  results  concerned  with  the  types  of  switched  systems  we’ve  been  dis¬ 
cussing  deal  mainly  with  questions  of  stability,  global  boundedness  and  conver¬ 
gence.  Interesting  as  they  may  be,  these  results  are  in  many  ways  less  than  one 
might  hope  for.  Especially  lacking  we  think,  are  results  of  a  more  quantitative  na¬ 
ture.  What  are  needed  are  good  norm  bound  estimates  for  allowable  unmodelled 
dynamics.  Also  needed  is  a  clearer  understanding  of  the  relationships  between 
these  estimates  and  design  parameters. 

Resolution  of  issues  such  as  these  calls  for  a  better  understanding  of  the  basic 
properties  of  switched  systems  than  we  have  at  present.  Needed  is  a  catalog  of 
basic  results  analogous  to  those  for  non-switched  linear  systems.  In  the  sequel 
we  briefly  discuss  some  of  the  technical  questions  suggested  by  types  of  problems 
we’ve  been  discussing. 


5.1  Stability  of  Switched  Linear  Systems 

Let  V  be  either  a  finite  set  or  a  closed,  bounded  subset  of  a  finite  dimensional 
linear  space  and  let  A  =  {Ap  :  p  G  'P}  be  a  closed,  bounded  set  of  real  n  x  n 
matrices.  Within  this  context  one  can  formulate  a  number  of  different  stability 
problems.  For  example,  one  can  seek  to  find  a  switching  logic  Z's,  with  input  x 
and  piecewise-constant  output  a  which  uniformly  exponentially  stabilizes 


X  =  AaX  (56) 

in  the  sense  that  there  are  positive  constants  a  and  A  such  that  all  solutions  x 
to  (56)  {in  closed-loop  with  Xs}  exist  and  are  norm  bounded  in  time  by  ae~^^ 

^  The  supervisor  executes  a  switching  cycle  at  clock  time  r  =  0  by  setting  cr{tQ  +  r)  = 
s(r),  r  G  [0,  nsTs)  where  to  is  the  actucil  time  r  was  reset  to  0  and  s  :  [0,  nsrs)  X 
is  the  piecewise-constant  function  whose  value  is  i  on  the  subinterval  [{i  —  l)rs,  iVs), 
i  G  I. 


i06 


times  the  initial  normed  value  of  x.  The  synthesis  of  such  a  state  driven  swi telling 
logic  seems  to  be  very  challenging.  See  [48]  for  a  discussion  of  some  recent  results 
along  these  lines. 

A  somewhat  less  ambitious  goal  would  be  as  follows:  For  a  given  set  A  — 
{-4p  :  p  E  P}  and  a  given  class  of  switching  signals  S,  find  conditions  under 
which  there  exist  positive  numbers  A  and  a  such  that  for  each  a  £  S  the  state 
transition  matrix  of  satisfies 

l^(^,r)|  <  t>T>0  (57) 

Interesting  choices  for  S  would  include 

1.  =  all  piecewise  constant  switching  signals 

2.  ^2  =  all  piecewise  constant  switching  signals  with  dwell  times  no  less  than 
some  positive  number  To- 

3.  53  =  the  type  of  switching  signals  generated  by  Sdc  {cF,  [46]}. 

Here  are  some  easy  to  derive  sufficient  conditions  under  which  there  exist  a  and 
A  such  that  (57)  holds  for  every  piecewise  constant  a  G  Si 

1.  There  exists  a  norm  ||  ■  ||  on  and  a  positive  number  A  such  that 

Heap'll  <  t>0,  per 

2.  The  elements  of  A  share  a  common  quadratic  Lyapunov  function;  i.e.,  there 
exists  a  positive  definite  matrix  Q  such  that 

QELp  +  yi;g<0,  ypev 

3.  V  {and  therefore  A}  are  finite  sets  and  ApAg  =  AqAp  for  all  p,  g  E  7^- 

4.  The  matrices  in  A  are  row  diagonally  dominant  with  negative  diagonal  ele¬ 
ments  ;  i.e.,  for  ^  [a^j]  E  A, 

n 

2aii  +  ^  \aij\  <0,  i  E  {1, 2,  .  .  .n] 

j=\ 

5.  There  exists  an  integer  n  >  n,  a  full  rank  fi  x  n  matrix  V  and  a  family  of 
h  X  h  matrices  ^  =  {^p:pE7^}  such  that 

(a)  =  per 

(b)  the  matrix  Aa  is  exponentially  stable  for  each  cr  G  Si 

The  sufficiency  of  conditions  1  and  2  are  more  or  less  obvious;  the  same  is 
true  of  condition  3  since  it  implies  that  the  associated  matrix  exponentials  of  the 
.4p  commute.  An  explicit  construction  is  given  in  [49]  of  a  matrix  Q  satisfying 
condition  2  for  a  set  of  matrices  A  satisfying  condition  3.  Recently  condition  1 
was  shown  to  be  necessary  for  (57)  to  hold  for  ail  a  G  Si  [50];  of  course  one  is 
still  faced  with  the  problem  of  deciding  when  such  a  norm  exists.  Condition  4 
can  be  easily  established  using  an  simple  estimate  based  on  the  idea  of  a  matrix 
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measure  {c.f.,  [51],  p.  47}.  Condition  5  is  a  consequence  of  the  fact  that  for  any 
solution  X  to  X  ~  AaX,  X  ~  Vx  is  a.  solution  to  x  =  A^jX 

Note  that  conditions  4  and  5  imply  that  the  following  is  sufficient  for  (57)  to 
hold  for  all  a  £  S\ 

Property:  There  exists  an  integer  h  >  n,  a  full  rank  h  x  n  matrix  V  and  a 
family  of  h  x  h  matrices  A  =  {Ap  :  p  EV}  such  that 

1.  ApV  =  VAp,  per 

2.  each  matrix  A^  is  row  diagonally  dominant  with  negative  diagonal  elements. 

Under  certain  conditions  this  property  proves  to  be  necessary  as  well  [52];  of 
course  one  still  needs  to  figure  out  when  V  and  the  Ap  exist.  For  some  results 
concerning  the  stability  of  switched  nonlinear  systems,  see  [44]. 

The  kind  of  stability  questions  we’ve  been  discussing  are  very  closely  related 
to  the  problem  of  deciding  when  for  a  given  class  of  matrices  {Mp  :p  eV}  and 
a  given  class  S  of  infinite  sequences  mapping  the  nonnegative  integers  into  V, 

i 

lim  TT  Mff(j)  =  0,  Vo-  6  iS 

i=i 

This  and  related  questions  have  been  addressed  in  [53]  and  [54]. 


5.2  Other  Questions 

Here  are  several  other  questions  involving  switched  linear  systems: 

-  Given  a  family  of  switching  signals  S  and  a  family  of  detectable  pairs 
{{Cp,Ap)  :  p  EV)  when  does  there  exist  a  matrix  function  p  \ — >  Kp  on  V 
for  which  A(j  -f  KaC^  is  ‘exponentially  stable’  for  each  a  E  S  1 

-  With  reference  to  the  preceding  suppose  /C  is  a  nonempty  class  of  matrix 
functions  p  \ — v  Kp  on  V  with  the  property  that  for  each  function  Kp  E  /C, 
Aa  +  KaCo  is  exponentially  stable  for  each  a  E  S.  For  a  given  class  of 
appropriately  sized  matrices  {F},  :  p  E  V]  compute  {or  at  least  tightly 
estimate)  the  sup  over  S  of  the  inf  over  /C  of  the  induced  £^[0,  oo)  norm  of 
the  linear  operator 


2/' — >  /  ^a{t)Ht,s)K^u)y{s)ds 
Jo 

where  (j)  is  the  state  transition  matrix  of  Aa  -f  KaCa- 
-  Given  a  class  of  linear  systems  {{Ap,  Bp,Cp,  Dp)  \pEV],  each  with  property 
P,  and  a  class  of  switching  signals  S,  when  is  it  true  that  for  each  a  E  S  the 
switched  system  {Aa ,  Ba,Ca,  Da)  also  has  property  P?  Interesting  choices 
for  P  include  stability,  stabilizahility,  passivity.  Some  findings  related  to  the 
last  of  these  appear  in  [55]. 
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6  Concluding  Remarks 


Switching  logics  such  as  those  discussed  in  this  paper  are  typically  derived  with¬ 
out  apj)ealing  to  any  foririal  notion  of  state  or  state  transition.  Explaining  such 
logics  informally  can  be  beneficial  and  we  have  sought  to  do  this  throughout  the 
paper.  On  the  other  hand  we’ve  found  that  it  is  also  worthwhile  to  go  through  the 
exercise  of  formally  modeling  such  logics  as  hybrid  systems  with  inputs,  states, 
state  transitions  and  outputs.  Formal  modeling  can  clarify  an  algorithm’s  func¬ 
tioning  by  reducing  ambiguity  and  in  so  doing  can  help  to  preclude  erroneous 
conclusions.  For  example,  when  cyclic  switching  was  first  devised  without  the 
benefit  of  a  formal  model,  it  was  thought  to  be  a  time-varying  system  which  it 
clearly  is  not.  The  formal  modeling  of  the  hysteresis  switching  logic  in  [12]  made 
it  easier  to  explain.  There  are  no  doubt  many  existing  switching  logics  whose 
behaviors  could  more  easily  grasped  if  they  were  both  explained  informally  and 
modelled  formally  as  hybrid  dynamical  systems. 

In  §2  it  was  emphasized  that  multi-controller  architectures  can  usually  be 
efficiently  implemented  as  state-shared  parameter  dependent  controllers.  There 
are  of  course  situations  when  it  is  advantageous  not  to  explicitly  compute  off¬ 
line  the  parameter-dependent  coefficient  matrices  of  such  a  controllers.  This  is 
especially  true  if  the  controller  in  question  is  of  the  certainty  equivalence  type 
and  if  the  associated  family  of  nominal  process  models  is  a  continuum.  For 
example,  suppose  that  the  set  Af  of  nominal  transfer  functions  considered  in 
§4.3  is  a  continuum;  suppose  in  addition  that  for  each  p  G  P,  controller  transfer 
function  Kp  is  to  be  designed  via  LQ-theory  applied  to  i/p.  Because  solutions 
to  matrix  Riccati  equations  depend  on  the  equation’s  coefficient  matrices  in  a 
complicated  {nonrational}  manner,  the  dependence  of  Kp  on  p  will  be  at  least  as 
complex,  even  if  lyp  depends  linearly  on  p.  What  this  means  is  that  the  problem  of 
explicitly  parameterizing  /C  assuming  an  LQ-based  controller  design  is  hopelessly 
intractable.  For  MIMO  process  models  the  problem  is  far  worse  even  for  simple 
pole-placement  designs.  There  are  at  least  two  ways  to  avoid  this  problem.  One 
is  to  compute  controller  coefficient  matrices  in  real  time;  this  is  feasible  with  a 
dwell-time  switched  supervisory  control  system  provided  the  computations  can 
be  carried  out  quickly  enough. 

Another  way  to  avoid  the  parameterization  problem  is  to  settle  for  a  smaller 
nominal  process  model  transfer  function  class  Af  containing  only  finitely  many 
elements.  In  fact  a  strong  case  can  be  made  for  doing  this  not  just  to  avoid  the 
parameterization  problem,  but  for  other  reasons  as  well.  For  example,  finiteness 
of  Af's  parameter  space  T  can  greatly  simplify  the  required  minimization  77 (IT,  p) 
over  V  even  if  T  is  very  large.  We  refer  the  reader  to  [56]  for  an  interesting 
discussion  of  how  one  would  go  about  covering  a  process  model  transfer  function 
class  of  the  form 

Cp  =  ^  H  1^1  |co  ^ 

p£V 

assuming  Af  =  {Pp  :  V}  is  a  compact  continuum  with  a  transfer  function  class 
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Cp  D  Cp  of  the  form 

Cp  =  [J  5  :  Halloo  <  ^p} 

pev 

where  'P  is  a  finite  subset  of  V  and  Cp  <  p  EV. 

One  of  the  underlying  idesis  exploited  in  section  4  is  that  modeling  uncer¬ 
tainty  can  be  dealt  with  by  switching  between  the  members  of  a  family  of  fixed 
gain  controllers.  The  idea  has  been  around  for  a  long  time.  For  example  there  is 
an  extensive  literature  on  the  “multiple- model”  {i.e.,  multi-estimator}  approach 
to  uncertainty  which  goes  back  almost  thirty  years;  see  for  example  [57]  and  the 
many  references  therein.  The  key  feature  of  the  estimator-based  approach  dis¬ 
cussed  in  §4.3  which  distinguishes  it  from  the  classical  multiple-model  approach 
is  that  switching  is  orchestrated  by  a  supervisor  using  a  logic  which  selects  con¬ 
trollers  on  the  basis  of  normed  output  estimation  errors.  Surprisingly,  a  provably 
correct  version  of  this  simple  idea  does  not  seem  to  have  found  its  way  into  the 
literature  until  quite  recently  [15]  -  this  despite  the  fact  that  the  idea  is  a  natural 
extension  of  the  original  concept  of  hysteresis  switching  [11]  which  appeared  in 
1988. 

For  the  case  when  V  contains  a  continuum  of  points,  the  dwell-time  switched, 
estimator-based  supervisory  control  discussed  in  §4.3  can  be  thought  of  as  a  form 
of  estimator-based  parameter  adaptive  control  in  which  the  supervisor  plays 
the  role  of  parameter  tuner  [58].  In  this  context,  the  concept  of  a  supervisor 
represents  a  significant  departure  from  more  traditional  estimator-based  tuning 
algorithms  which  typically  employ  recursive  or  dynamical  parameter  tuning. 
Most  closely  related  to  what  we’ve  been  discussing  seems  to  be  the  type  of 
adaptive  algorithm  studied  by  Naik,  Kumar  and  Ydstie  in  [59].  Both  the  NKY 
algorithm  and  the  dwell-time  switched  supervisor  discussed  in  §4.3  search  on 
compact  parameter  spaces;  both  are  inherently  robust  to  unmodelled  dynamics 
in  that  dynamic  normalization  [60]  is  not  employed.  Perhaps  the  most  significant 
differences  between  the  two  are  1.  that  the  NKY  algorithm  employs  recursive 
parameter  tuning  {i.e.,  pseudo-gradient/projection  search)  whereas  the  dwell- 
time  switched  supervisor  does  not  and  2.  the  dwell-time  switched  supervisor 
allows  time  for  computation  whereas  the  NKY  algorithm  does  not. 

It  is  reasonable  to  suspect  that  many  of  the  ideas  covered  in  §4.3  can  be 
successfully  applied  to  specific  classes  of  nonlinear  systems.  The  well-known  ob¬ 
stacles  to  generalization  imposed  by  a  limited  nonlinear  observer  theory  can 
almost  certainly  be  side-stepped  by  focusing  attention  on  systems  whose  states 
can  be  measured.  It  is  quite  likely  that  a  supervised  family  consisting  of  a  finite 
number  of  fixed  nonlinear  controllers  will  prove  easier  to  analyze  than  a  con¬ 
tinuously  parameterized  family  of  controllers  under  the  control  of  a  parameter 
tuner.  Understanding  such  switched  systems  calls  for  new  methods  of  analysis 
which  go  beyond  the  partial  Lyapunov  function  approach  commonly  used  in  the 
study  of  conventional  parameter  adaptive  systems. 


110 


Si  inula  t  ions 

The  general  concept  of  a  d well-time  switched,  estimator-based  supervisory  con¬ 
trol  system  described  in  this  paper  has  been  tested  in  simulation  under  a  variety 
of  conditions.  The  reader  wishing  to  experiment  with  these  simulations  can  ob¬ 
tain  Matlab  Simulink  files  via  the  internet  at  the  address 

http:/ /www.cis.yale.edu/'-^wchang/workshop.html 

The  simulations  were  designed  and  implemented  by  Wen-Chung  Chang  and  Joao 
Hespanha. 
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